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i Abstract. We revisit and reinterpret Selten's concept of subgame perfectness 

in the context of single agent normal form sequential decision making, which 
leads us to the concept of subtree perfectness. Thereby, we extend Hammond's 
characterization of extensive form consequentialist consistent behaviour norms 
to the normal form and to arbitrary choice functions under very few assump- 
tions. In particular, we do not need to assume probabilities on any event or 
utilities on any reward. We show that subtree perfectness is equivalent to 
normal-extensive form equivalence, and is sufficient, but, perhaps surprisingly, 
^ (— I not necessary, for backward induction to work. 
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■ 1. Introduction 

^— ^ ' In a single agent sequential decision problem, at any stage, one has two ways 

\q , of looking at its solution: the problem can be considered either in its simplest 

£C) ' form — discarding any past stages, or as part of a much larger problem — possibly 

, considering choices and events that did not actually obtain. A reasonable require- 

ment is that, at any stage, the solution is independent of the larger problem it is 
embedded in. In this paper, we call this requirement subtree perfectness. 

Sclten |23] introduced a similar idea for multi-agent extensive form games, called 
subgame perfectness. In such games, players decide sequentially, and a player's 
behaviour strategy specifies a decision at each point where he must choose. An 
equilibrium point is a strategy for each player, such that no player would change 
their strategy if they knew the strategy of all others. 

A subgame is a part of a game that is again a game. For example, if a game 
consists of four rounds, then after having played two rounds, one can consider the 
remaining two rounds as a separate game. An equilibrium point of the full game 
of course implies a strategy for each player in the subgame simply by restricting 
the strategy of each player in the full game to only that subgame. If, for every 
subgame, the restriction of the equilibrium point of the full game to that subgame 
also yields an equilibrium point of that subgame, then the equilibrium point of the 
full game is called subgame perfect. 
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Figure 1. Two- Figure 2. Second stage, 

stage problem. 

Selten showed that for games with perfect recall, (perfeclQ) equilibrium points 
are subgame perfect [531 P- 39, Thm. 2], that is, they are independent of any larger 
game in which they could be embedded. 

As mentioned, in this paper we investigate single agent sequential decision mak- 
ing modelled by decision trees. Although such problems differ in many ways from 
extensive form games, subtree perfectness is clearly analogous to subgame perfect- 
ness, as the following example shows. 

Consider the decision problem in Fig. [TJ In the first stage, the subject chooses 
between taking scones, or proceeding to the second stage. In the second stage, 
the subject chooses between cake, or ice cream. Suppose the subject prefers to 
reject scones and to choose ice cream at the second stage. This strategy induces a 
substrategy in the subtree for the second stage: choose ice cream over cake. 

But, as with multi-agent games, we can instead consider the subtree for the sec- 
ond stage separately, as in Fig. [2j If, in this smaller tree, the subject prefers ice 
cream, then his solution is subtree perfect: his solution for the full tree induces a 
strategy in the subtree, and this strategy coincides with his solution for the subtree. 
If the subject states a difference preference (either no preference, or clear prefer- 
ence for cake), then his solution lacks subtree perfectness. So, subtree perfectness 
essentially means that the optimal induced strategies in a subtree do not depend 
on the full tree in which the subtree is embedded. 

Our goal is to determine the conditions under which a theory of choice is subtree 
perfect. We consider a very large class of theories of choice, namely any that can 
be represented by conditional choice functions on gambles: we merely assume that 
for any set of gambles (functions from the possibility space O to a set of rewards 1Z; 
these generalize random variables, or horse lotteries), and any conditioning event, 
the subject can give a non-empty subset of gambles that he considers optimal. 
Gambles that are non-optimal would never be selected, and the subject is unable 
to express further preference between the optimal ones. Maximizing expected utility 
is a simple example of such a choice function. 

General choice functions, however, need neither probability nor utility — not even 
a preorder. Consequently, we must take care to distinguish clearly between the nor- 
mal form and the extensive form of a decision problem (or game) . While these forms 
are equivalent when maximizing expected utility [20[ 1.3.4], their equivalence breaks 
down when expected utility is abandoned, as shown by for instance Seidenfeld [3T] , 
Machina [16], and Jaffray [9], among others. 

Further, the standard definitions of normal and extensive form are not compat- 
ible with many choice functions. The usual extensive form solution for a decision 



1 Without going into much detail, a perfect equilibrium point is one which is stable under small 
perturbations [23] p. 38]. 
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tree, as given by Raiffa and Schlaifer [20, 1.2.1], involves using backward induction 
to replace subtrees with their maximum expected utility. Clearly, this definition is 
too restrictive for general choice functions, particularly those who do not correspond 
to a total preorder. Therefore, instead, we use the terms normal and extensive form 
in a far more general sense, while retaining their core features: the normal form 
specifies all actions in all eventualities in advance, whereas the extensive form spec- 
ifies actions locally at each decision node. 

Our main results start out from the traditional normal form method of listing 
all strategies, finding their corresponding gambles, applying a choice function, and 
listing all strategies that induce optimal gambles. In doing so, we assume act-state 
independence — that is, choice functions do not depend on the decision. Dropping 
act-state independence is beyond the scope of this paper. Under this assumption, 
we find three necessary and sufficient conditions for a choice function to induce 
a subtree perfect normal form solution. Further, we show that a normal form 
solution induced by a choice function satisfying these conditions will always have 
an equivalent extensive form representation. Interestingly, this does not hold for 
subtree perfect normal form solutions in general, only those induced by a choice 
function. 

Our results are very similar to those of Hammond [BJ, although it may appear 
that Hammond's goal is quite different from ours. Hammond considers extensive 
form solutions only, calls them behaviour norms [6j p. 28], and defines a notion 
of consistency which corresponds exactly to what we call subtree perfectness for 
extensive form solutions. Two trees are called strategically equivalent [16| p. 1636] 
if their normal form decisions (or, strategies) induce the same set of gambles. Ham- 
mond calls a behaviour norm consequentialist if it preserves strategic equivalence, 
that is, if strategically equivalent trees also have strategically equivalent behaviour 
norms. Hammond shows that a consistent and consequentialist behaviour norm 
implies a choice function on gambles, and then investigates the properties of this 
choice function. Although our goal, to find necessary and sufficient conditions on 
arbitrary choice functions under which they induce subtree perfect normal form 
solutions, is apparently different, Hammond's approach effectively amounts to the 
same thing, but starting from extensive form, and only for a particular class of 
choice functions. 

Even more so, because, as we will show, subtree perfectness implies normal- 
extensive form equivalence, unsurprisingly, Hammond's necessary and sufficient 
conditions on a choice function for it to be implied by a consistent and consequen- 
tialist behaviour norm, are very similar to our necessary and sufficient conditions 
for subtree perfectness. Hammond's first condition, that the choice function must 
induce a total preorder, is identical to our Property [2j His second, the indepen- 
dence axiom, is not present in our work because it only arises if probabilities can be 
assigned to certain chance arcs, an assumption we do not make. His third, a type 
of sure-thing principle, is very similar to our Property [3j where the difference is 
again based on whether or not probabilities are admitted. Finally, our Property [1] 
is not present in Hammond's account, because of a small difference in the definition 
of decision trees. 

Since our study is closely related to Hammond's work, and makes use of many 
similar ideas, we shall throughout note where our definitions and properties coincide 
with those of Hammond, and also discuss the implications of the differences in 
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approach. It turns out that the only significant difference is dependent upon exactly 
how consequentialism is defined. Hammond writes a description of conscquentialism 
that turns out to be essentially identical to our approach, but his mathematical 
definition that he uses in all proofs is actually a weaker version, with which one 
cannot prove results quite as strong as ours. In Theorem 1371 we unify our work 
with Hammond's informal but strong definition. 

Subtree perfectness also relates to Machina's 16, p. 1627] concept of separability 
over mutually exclusive events, which essentially boils down to subtree perfectness 
for probability trees. Although Machina defines separability using probabilities and 
a total preorder, it can be easily generalized for arbitrary choice functions and more 
general gambles that do not involve probabilities. All such possible generalizations 
seem to be equivalent to, or implied by, our Property El 

A different concept called separability is defined by McClennen [TTJ p. 122]. Un- 
der the assumption of dynamic consistency [171 p. 120], this form of separability 
proves to be equivalent to subtree perfectness. The condition of dynamic consis- 
tency is implicitly assumed in our solutions. Although once again McClennen's 
decision trees are somewhat different from ours, his Theorems 8.1 and 8.2 are very 
similar to our Lemma l2Tl and indeed the decision trees used in the proofs are almost 
identical. 

Subtree perfectness is strongly linked to backward induction methods for solv- 
ing decision trees. Indeed, under a total preorder, failures of backward induction 
are often used to illustrate violations of subtree perfectness [HI IE] ■ We find that 
subtree perfectness is sufficient, but, perhaps surprisingly, not necessary, for back- 
ward induction to work. In particular, a total preorder is not required. Further, 
backward induction implies a weaker form of subtree perfectness, in which every 
strategy must induce an optimal substrategy in every subtree, but the set of opti- 
mal strategies is not required to induce the set of all optimal substrategies in every 
subtree. 

The paper is structured as follows: Section [2] explains decision trees and intro- 
duces notation. Section provides a careful definition of normal and extensive form 
solutions, and introduces the concept of gambles to more easily work with normal 
form solutions. Section [4] introduces choice functions and their relationship with 
normal form solutions. Section [5] defines subtree perfectness and contains the prin- 
cipal results. Section [6] explores when a backward induction method can be used 
to find the normal form solution induced by a choice function. Section [7] examines 
the relationships subtree perfectness has with backward induction, extensive form 
equivalence, and the ideas of Hammond, McClennen, and Machina. 

2. Decision Trees 

In this section, we explain the basic ideas behind decision trees and notation 
which will be used throughout the rest of the paper. For more information about 
decision trees, we refer to the literature [HI [3] . 

2.1. Definition and Example. A decision tree consists of a rooted tree [51 p. 92, 
Sec. 3.2] of decision nodes, chance nodes, and reward leaves, growing from left to 
right. The left hand side corresponds to what happens first, and the right hand 
side to what happens last. The payoffs for each sequence of decisions and events 
are at the end. 



Figure 3. Payoff table and decision tree for walking in the lake district. 



Consider the following example. Tomorrow, a subject is going for a walk in 
the lake district. It may rain (2?i), or not (i^)- The subject can either take a 
waterproof (di), or not (g^), and can also choose to buy today's newspaper to 
learn tomorrow's weather forecast (ds), or not (<%). Suppose that the forecast can 
have either two outcomes: predicting rain (Si), or not (S?)- The utility of each 
combination, if the subject does not buy the newspaper, is summarized in Figure [3] 
(left). If the subject buys the newspaper, then one utile is subtracted. 

Figure [3] also depicts the decision tree. Squares are decision nodes, and circles 
are chance nodes. From each node, branches emerge. For decision nodes, these are 
decisions; for chance nodes, these are events. For each chance node, the events form 
a partition of the possibility space, contrary to Hammond [51 p. 31], where events 
at chance nodes form a partition of the set of all states of nature still possible 
upon reaching the chance node. This leads to several technical differences, that 
will be seen in Definition [8j Property [TJ and Property [7] Furthermore, our chance 
nodes are called natural nodes by Hammond. Hammond's chance nodes involve 
probabilities and do not appear in our decision trees. 

In our example, the subject must first decide whether to buy the newspaper or 
not, hence we start with a decision node. If he buys the newspaper (ds), then 
he learns about tomorrow's forecast. Thus, the chance node following ds has two 
branches, forecasting rain (Si), or no rain (82)- Next, when leaving for the trip, he 
can either bring his waterproof (di) or not (^2), hence the decision node following 
Si. During the walk, a chance node yields rain (Ei) or not (E 2 ). 

So, each path in a decision tree amounts to a particular sequence of decisions 
and events. The payoffs resulting from each such sequence are put at the end. 

2.2. Notation. Before elaborating how decision trees can be solved, we introduce 
a convenient mathematical notation for them. Decision trees can be seen as com- 
binations of smaller decision trees: for instance, in the lake district example, one 
could draw the subtree corresponding to buying the newspaper, and also draw the 
subtree corresponding to making an immediate decision. The decision tree for the 
full problem is then formed by joining these two subtrees at a decision node. 

Hence, we can represent a decision tree as follows. Let Ti, . . . , T n be decision 
trees and E t , . . . , E n be a partition of the possibility space. If T is formed by 
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combining the trees at a decision node, we write 

n 

t=\_\t 1 . 

1=1 

If T is formed by combining the trees at a chance node, with subtree T t being 
connected by event E i , we write 

n 

T = QE i T i . 

i=i 

For instance, for the tree of Fig. [3j we write 

(Si(Ti U T 2 ) S 2 (Ti U T 2 )) U U U 2 ) 

with 

Ti = £a9o£ 2 14 Ui = SilO £ 2 15 T 2 = Ei4 £ 2 19 f/ 2 = Si 5 E 2 2Q 

In certain circumstances it will be necessary to consider decision trees formed by 
adding a decision node in front of a tree T, that is, a decision tree whose root is a 
decision node with one option. Such a tree shall be denoted by UT. 

In this paper we shall often be considering subtrees of larger trees. For subtrees, 
we need to know the events that were observed in the past. Two subtrees with 
the same configuration of nodes and arcs may have different preceding events, and 
should be treated differently. Therefore we associate with every decision tree T an 
event ev(T): the intersection of all the events on chance arcs that have preceded 
T. Hammond p. 27] denotes these events by S(n). 

Definition 1. A subtree of a tree T obtained by removal of all non-descendants of 
a particular node N is called the subtree of T at N and is denoted by st^(T). 

Subtrees are called continuation trees by Hammond [B]. 

3. Solving Decision Trees 

This paper deals with more general solutions of decision trees than are usually 
considered. Consequently, the usual definitions of extensive and normal forms, such 
as in Raiffa and Schlaifer [2U], are insufficient. Therefore, we first carefully define 
normal and extensive form solutions. 

3.1. Extensive and Normal Form Solutions. The usual definition of extensive 
form is based on backward induction and expected utility. At each ultimate decision 
node, we calculate the expected utility of each option, and choose a maximal arc, 
replacing that node by its maximum expected utility. The penultimate decision 
nodes have now become the ultimate ones, so we can repeat this process, until the 
root node. 

If options are not assigned values, then this type of backward induction cannot 
be used. Therefore, we abandon the link with backward induction, and instead 
focus on another property of extensive form: the decision arc to follow only needs 
to be specified when the subject actually reaches the decision node. 

An extensive form solution of a decision tree removes from each decision node 
some (possibly none), but not all, of the decision arcs. So, an extensive form 
solution is a subtree of the original decision tree, where at each decision node a 
non-empty subset of arcs is retained. For instance, in the lake district example, 
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one of the extensive form solutions is: do not buy the newspaper, and then either 
take the waterproof or not. An extensive form solution can be used as follows: the 
subject, upon reaching a decision node, chooses one of the arcs in the extensive 
form solution, and follows it. The subject only needs to decide which arc to follow 
at a decision node when reaching that node. 

In contrast, normal form solutions specify all future decisions at the start, so 
no further decisions need to be taken as time progresses. Classically, normal form 
solutions involve listing all possible combinations of actions and then choosing one 
to maximize expected utility. This generalizes as follows. 

First, an extensive form solution with just one arc out of each decision node, 
is called a normal form decision. For instance, in the example, one of the normal 
form decisions is: buy the newspaper, and take the waterproof if the newspaper 
predicts rain, but do not take the waterproof otherwise. We denote the set of all 
normal form decisions for a decision tree T by nfd(T). Normal form decisions are 
also called strategies, pure strategies, plans, and policies. 

A normal form solution of a decision tree T is then simply a subset of nfd(T). 
The interpretation of this subset is that the subject picks one of the normal form 
decisions of the normal form solution, and then acts accordingly. 

It is natural to ask whether solutions of one form can be meaningfully trans- 
formed into solutions of the other. This question is addressed in Section 17.21 For 
now, note only that there are usually more normal form solutions than there are ex- 
tensive form solutions. For example, consider the tree below, 
and suppose a normal form solution contains two normal form 
decisions, namely d\d\ and d\d\. Any attempt to find a cor- 
responding extensive form solution will have to include all four i 

decision arcs, but that would also correspond to the normal form 
solution where all four normal form decisions are present. There- v_y \ ,H 




fore there is no one-to-one correspondence between extensive and 
normal form solutions. 

These two forms of solution are not the only possibilities. For instance, McClen- 
nen [17j considers a form of solution where the subject must give a set of optimal 
normal form decisions at every node in the tree. The interpretation is apparently 
that the subject chooses a plan every time he reaches a node. We shall call such 
a solution a dynamic normal form solution. Our normal form solutions are special 
cases of dynamic normal form solutions, where the plan at any node is simply the 
restriction, to the node in question, of the plan at the root node (McClennen calls 
this a dynamically consistent solution). 

3.2. Extensive and Normal Form Operators. An extensive form operator is 
a function which maps each decision tree to an extensive form solution of that 
decision tree. The method by which decision arcs are removed is not specified, and 
in particular, need not be related to backward induction. Hammond [6, p. 28] calls 
these operators behaviour norms. 

A normal form operator maps each decision tree to a normal form solution of 
that tree. Again, the method by which this happens is not part of our definition. 

These operators are usually interpreted as describing optimality. 

Clearly, the classical extensive and normal form interpretations are, respectively, 
extensive and normal form operators. Classical backward induction deletes any arc 
with non-maximal expectation, and results in a tree with some arcs deleted: an 
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LOl CJ2 U>3 UJ4, 



Si9 0S 2 14 9 9 14 14 
Si (£ , i9 0_B 2 14) ®S 2 (£i4e£ 2 19) 9 4 14 19 
Table 1. Example of normal form gambles. 



extensive form solution. The classical normal form finds the set of normal form 
decisions with maximal expectation: a normal form solution. 

3.3. Gambles. To express normal form decisions and solutions efficiently, we first 
introduce some definitions and notation. Let Q be the possibility space: the set of 
all possible states of the world. We only consider finite possibility spaces. Elements 
of fi are called typically denoted by ui. Subsets of fi are called events. Let 1Z be a 
set of rewards. For our results we need not assume 1Z = M. 

A gamble is a function X : fl — > 1Z, and is interpreted as an uncertain reward: 
once ui £ fi is observed, X yields X(ui). Probabilities over f2 are not needed. 

3.4. Normal Form Gambles. Once a normal form decision is chosen, the reward 
is determined entirely by the events that obtain. So, every normal form decision 
has a corresponding gamble, which we call a normal form gamble. The set of all 
normal form gambles associated with a decision tree T is denoted by gamb(T). 

Using Fig. [3l we explain how to find normal form gambles. First consider the 
subtree at N^, which has normal form decisions d\ and <i 2 . The former gives 
reward 9 utiles if uj S E\ and 14 utiles if u> £ E2 , and so yields the gamble 



The © operator combines partial maps defined on disjoint domains (i.e. the partial 
map -Ei 9 defined on E\, and the partial map E 2 14 defined on E 2 ). 

Now consider the subtree with root at N t , and in particular the normal form 
decision l d\ if Si and d 2 if S2 '■ This gives reward 9 if u> £ Si D E\, reward 14 if 
u> G Si H E2 , and so on. The corresponding gamble is 



where multiplication with an event is now understood to correspond to restriction, 
i.e., 9 is a constant map on 51, Ei9 is a constant map restricted to E\, and Si(Ea9) 
is obtained from Ei9 by further restriction to E\ (iSi. For illustration, we tabulate 
the values of some normal form gambles in Table [1] where Q — {u>i, U2, UJ3, W4}, 
Ei = {uii,U) 2 }, and Si = {uj 17 uj 3 }. 

Observe that the gamble in Eq. ([2]) incorporates the gamble in Eq. (fTJ). This 
observation allows a very convenient recursive definition of the gamb operator. 

Definition 2. For any events E%, . . . , E n which form a partition, and any finite 
family of sets of gambles X\, . . . , X n , we define the following set of gambles: 



(1) 



Ei9©£: 2 14. 



(Si n £i)9 © (Si n # 2 )14 © (s 2 n e x )a © (s 2 n e 2 )19, 

or briefly, if we omit 'fY and employ distributivity, 

(2) Si(Ei9fflE 2 14)©S 2 (Ei4©£; 2 19), 



(3) 
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Definition 3. With any decision tree T, we associate a set of gambles gamb(T), 
recursively defined through: 

• If a tree T consists of only a leaf with reward r 6 1Z, then 
(4a) gamb(T) = {r}. 

• If a tree T has a chance node as root, that is, T = O i=1 EiT^, then 

(n \ n 

Qe.tA = @ J E i gamb(T i ). 
i=l / i=l 

• If a tree T has a decision node as root, that is, if T = |J" =1 Tj, then 

/ n \ n 

(4c) gamb |J = |J S amb (^). 



\i=l 



It is easy to show that this definition gives indeed the required set, 
(5) gamb(T) = |J gamb(C7). 

[/Gnfd(T) 

Hammond uses F(T,n) for gamb(stA/-(T)). McClennen uses G{T) for gamb(T). 

Multiple decision trees can model the same problem. This suggests the following 
definition (see for instance p~2l H6] ): 

Definition 4. Two decision trees T\ and Ti are called strategically equivalent if 
gamb(T 1 ) = gamb(T 2 ). 

Hammond p. 38] uses the term consequentially equivalent. 

4. Normal Form Solutions for Decision Trees 

4.1. Choice Functions and Optimality. We defined a normal form solution as 
a subset of all normal form decisions. Ideally one would like to identify a single 
best normal form decision, but this is not always possible. The subject might, 
however, still be able to eliminate some normal form decisions that he would never 
consider choosing, leaving a subset of normal form decisions. We say that the 
subject considers these as optimal. 

In classical decision theory, each normal form decision induces a random real- 
valued gain, and is considered optimal if its expected gain is maximized. As another 
example, suppose that a set M of plausible probability distributions are specified. 
Then the subject might consider optimal all normal form decisions whose expected 
gain is maximal under at least one distribution in M . 

So, often, optimal decisions are determined by comparison of gambles. We follow 
this common approach, since normal form decisions have corresponding gambles, 
and gambles are easier to work with. We therefore suppose that the subject can 
determine an optimal subset of any set of gambles, conditional upon an event A 
(corresponding to ev(T) of the tree T in question): 

Definition 5. A choice function opt maps, for any non-empty event A, each non- 
empty finite set X of gambles to a non-empty subset of this set: 

^ opt(X\A) C X. 
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Note that common uses of choice functions in social choice theory, such as by 
Sen [24l p. 63, 11. 19-21] do not consider conditioning, and define choice functions 
for arbitrary sets of options (not for gambles only) . 

4.2. Normal Form Operator Induced by a Choice Function. Now, given a 
choice function opt, we naturally arrive at a normal form operator norm opt , simply 
by applying opt on the set of all gambles associated with the tree T and then finding 
the corresponding set of normal form decisions. 

Definition 6. Given any choice function opt, and any decision tree T with ev(T) ^ 
0, we define 

normopt (T) = {U e nfd(T) : gamb(*7) C opt(gamb(T)|ev(T))}. 

Of course, since the U are normal form decisions, gamb(t/) is always a singleton 
in this definition. In particular, the following important equality holds, 

(6) gamb(norm opt (T)) = opt(gamb(T)|ev(T)). 

It follows immediately that norm opt respects strategic equivalence: 

Theorem 7. If Tj and T2 are strategically equivalent and ev(Ti) = ev(T2) 7^ 0, 
then gamb(norm opt (Ti)) = garnb(norm pt(T 2 )). 

This is of course an attractive property, as there are always many strategically 
equivalent trees representing the same problem: Theorem [7] guarantees that all 
equivalent representations yield the same solution. 

When studying subtree perfectness, we consider norm opt for arbitrary subtrees. 
To ensure that norm opt can be applied on each of these, we need: 

Definition 8. A decision tree T is called consistent if for every node N of T , 

ev(stjv(T)) ^ 0. 

Clearly, if a decision tree T is consistent, then for any node N in T, stjy(T) 
is also consistent. We study only consistent decision trees because we consider 
norm opt (stjv(T)) for any node N in T, which is impossible when ev(stjv(T')) = 0. 

Usually, one does not consider events which conflict with preceding events, hence 
consistency is satisfied. However, due to an oversight, some branch of a chance 
node might represent an event that cannot occur: such tree can always be made 
consistent by removing those nodes whose conditioning event is empty. 

Not all sets of gambles can be represented by a consistent decision tree: 

Definition 9. Let A be any non-empty event, and let X be a non-empty finite set 
of gambles. Then the following conditions are equivalent; if any (hence all) of them 
are satisfied, we say that X is A-consistent. 

(A) There is a consistent decision tree T with ev(T) = A and gamb(T) = X . 

(B) For every r £ 1Z and every X £ X such that X _1 (r) ^ 0, it holds that 
X- 1 (r)nA^0. 

We will also say that a gamble X is A-consistent whenever {X} is A- consistent. 

5. Subtree Perfectness 

We now define subtree perfectness, for both types of operator, and find necessary 
and sufficient conditions on opt for norm opt to be subtree perfect. 
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Figure 4. For a subtree perfect extensive form operator, opti- 
mization and restriction commute. 

5.1. Example and Definition. First, we illustrate subtree perfectness by an ex- 
ample. Suppose we apply an extensive form operator to the tree T in Fig. [31 This 
operator will delete some (possibly none) of the decision arcs at N = N 1 \ . If the 
operator would delete the same arcs at N regardless of the larger tree in which 
stjy(T) is embedded, then the operator is subtree perfect. If the operator does not 
have this property (for instance, if the solution of T after N were to depend on con- 
sequences of dg or S2), then it fails subtree perfectness. Hammond 6, p. 34] calls 
subtree perfectness for behaviour norms (extensive form operators) consistency. 

The definition for subtree perfectness for a normal form operator requires the 
following extension to Definition Q] 

Definition 10. If T is a set of decision trees and N a node, then 



Definition 11. An extensive form operator ext is called subtree perfect if for every 
consistent decision tree T and every node N such that N is in ext(T), 



A normal form operator norm is called subtree perfect if for every consistent deci- 
sion tree T and every node N which is in at least one element o/norm(T), 



In other words, for a subtree perfect operator, it does not matter whether we first 
restrict to a subtree and then optimize, or first optimize and only then restrict to 
the subtree: subtree perfectness means that optimization and restriction commute, 
as in Fig. |4] 

The extensive form operator ext p corresponding to the usual backward induction 
using expected utility is well known to be subtree perfect, provided probabilities 
at chance nodes are non-zero. Also, the usual normal form operator normp corre- 
sponding to maximizing expected utility over all normal form decisions is subtree 
perfect, because extp is equivalent to normp. 

Not all choice functions are subtree perfect: 

Example 12. Let T be the decision tree in Fig.\B[ where X , Y , and Z are its normal 
form gambles. Under point-wise dominance, X and Y are incomparable, as are Y 
and Z. Hence, norm(stAr(T)) is {X, Y} (where we conveniently identified normal 
form decisions with their normal form gambles). But norm(T) = opt({X, Y, Z}) = 



stjv(T) = {stjv(T) -.TeTandNin T}. 



st at (ext (T)) = ext(stjv(T)). 



stAr(norm(T)) 



norm(stjy(T')). 
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FIGURE 5. Decision tree for Example fl2l 

{Y, Z} as clearly Z dominates X. Restricting this solution to stjv(T) gives the 
normal form solution {Y}. Concluding, 

{X,Y} = norm(st A r(T)) ^ st^Cnorm^)) = {Y} 

and therefore the normal form operator induced by opt lacks subtree perfectness. 

5.2. Necessary and Sufficient Conditions, fn this section, we work extensively 
with normal form solutions, which are sets of trees. Therefore, we extend gamb, 0, 
and U, to sets of trees: 

Definition 13. For any set of decision trees T, gamb(7~) = lJ Tg7 -gamb(T). 

Definition 14. For any sets of consistent decision trees 7~i, ■ T n , and any 
partition E\, . . . , E n , let 

OILi EOi = {Qti E l T l : Ti e 71} , 

UUTi = {UUT i :T i eT i }. 

For sets of trees, the gamb operator keeps working as expected: 

(7) gamb {Q n l=1 EiTi) = ®ti & g&mb(%), 

(8) gamb(U n =1 77) = Ur=igamb(7;). 

One should also observe that gamb(T) = gamb(nfd(T)). 

The next three properties turn out to be necessary and sufficient for subtree 
perfectness of normal form operators induced by a choice function. 

Property 1 (Conditioning Property). Let A be a non-empty event, and let X 
be a non-empty finite A-consistent set of gambles, with {X, Y} C X such that 
AX = AY. IfXe opt(X\A), then Y e opt(X\A). 

This property is not found in the accounts of Hammond and McClennen, because 
their decision trees are different from ours. If their methods were adapted to our 
decision trees, then the conditioning property would appear in their work too. The 
property is more of a technical detail than an important point, since it states that, 
if two gambles are equal on an event, one gamble cannot be preferred to the other 
given that event. Such a property would be common and desirable for a choice 
function, regardless of its implications for subtree perfectness. 

Property 2 (Intersection property). For any event A ^ and non-empty finite 
A-consistent sets of gambles X and y such that y C X and oj>t(X\A) n y 0, 

opt(y\A) = opt(x\A)ny. 
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The intersection property is precisely Arrow's (C4) [2j p. 123]. 
For the next property, we use the following extension of the © notation: if A is 
a non-trivial event (non-empty and not SI), then 

AX © AZ = {AX ®AZ:XeX}. 

Property 3 (Mixture property). For any events A and B such that A n B ^ 
and A n B j£ 0, any A n B-consistent gamble Z, and any non-empty finite A n B- 
consistent set of gambles X , 

opt(AX ®AZ\B) = Aopt(X\An B) ®AZ. 

This property is a form of the well known independence principle (see for example 
[T71 p. 44]). It has strongest similarities to McClennen's independence for choice 
[T7l p. 57], and Arrow's conditional preference [IJ p. 257]. 

Property [2] has a many equivalent formulations. The following three give inter- 
esting alternative interpretations, and are useful in some of the proofs. 

Property 4 (Strong path independence). For any non-empty event A and any 
non-empty finite A-consistent sets of gambles X\, ...,X n , there is a non-empty 
1C {1, . . . , n} such that 



opt Q Xi 



A) = |J opt(Xi\A) 



iei 



Property 5 (Very strong path independence). For any non-empty event A and 
any non-empty finite A-consistent sets of gambles X\, ... ,X n , 



opt Q Xi 



A J = (J opt(^|A) 



i=l 

A , i no P t(u^ =1 A , ,|A)^0 



Property 6 (Total preorder). For every event A^%, there is a total preorder >za 
on A-consistent gambles such that for every non-empty finite set of A-consistent 
gambles X, 

opt(X\A) = {X e X : (VY 6 X)(X ±A Y)} 

The total preorder property essentially boils down to Arrow's (C5), which is also 
called the weak axiom of revealed preference [5J p. 123]. More equivalents are given 
by Arrow [2„ (CI)], Houthakker [7J p. 163], and Ville 25, p. 123]. 

Lemma 15. Properties\^^\5\ and\fi[ are equivalent. 

Proof. Property [5] =4> Property [5] Property H] ==> Property [2] Immediate. 
Property[l=^ PropertyE] See Arrow's (C4) <S=> (C5) p. 124, Thm. 1]. □ 

To show that Properties [lj [2] and |3] are necessary and sufficient for subtree 
perfectness of norm opt , we require several lemmas. The proofs are long but mostly 
tedious and straightforward, and so are omitted. 

For a decision tree T, ch(T) is the set of child nodes of the root node of T. 

Lemma 16. Lei norm be any normal form operator. LetT be a consistent decision 
tree. If, 

(i) for all nodes K £ ch(T) such that K is in at least one element o/norm(T), 
stif(norm(T)) = norm(stK(T)), 
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(ii) and, for all nodes K G ch(T), and all nodes L <E st k{T) such that L is in 
at least one element o/norm(stir(T)), 

stz (norm(st jc(T))) = norm(st L (st A '(T))), 

then, for all nodes N inT such that N is in at least one element of norm(T), 

stjv(norm(T)) = norm(stjv(T)). 

Lemma 17. Let A\, . . . , A n be a finite partition of CI, and let B be an event such 
that Aid B ^ for all i. Let X\, . . . , X n be a finite family of non-empty finite 
sets of gambles, where Xi is Ai n B -consistent. If a choice function opt satisfies 
Properties^ and[3J then 



(9) opt 



i=l 



B \ =0^opt(Ai|AnB). 

/ i=l 

Lemma 18. Consider a consistent decision tree T whose root is a decision node, 
so T = |J" =1 Ti, and any choice function opt. For each tree Ti, let be its root. 
Then, Ni is in at least one element of norm op t (T) if and only if 

(10) gamb(T 2 ) n opt(gamb(T)|ev(T)) ^ 0. 

Lemma 19. For any consistent decision tree T — Q"=i EiTi, and any choice 
function opt satisfying Property^ 

n 

(11) gamb(norm opt (T)) = @ E { gamb(norm opt (T i )) 

i=i 

implies 

n 

norm opt (T) = Q E { normopt (Ti). 
»=l 

Lemma 20. For any consistent decision tree T = |_|" =1 Ti and any choice function 
opt satisfying Property^ 

(12) gamb(norm opt (T)) = [J gamb(norm opt (T' i )) 
implies 



norm, 



opt 



(T)=nfd^|J normopt^, 



where 2= {i € {1, . . . , n} : gamb(Ti) n opt(gamb(T)|ev(T)) ^ 0}. 

The next lemma shows necessity of Properties [T][2] and[3]for subtree perfectness. 
Interestingly, the proof only involves the two decision trees in Figure El 

Lemma 21. 7/norm opt is subtree perfect, then opt satisfies Properties\Jl\^ and\^ 

We are now ready to establish that Properties [TJ [2j and [3] are also sufficient for 
subtree perfectness. 

Theorem 22 (Subtree perfectness theorem). A normal form operator norm opt is 
subtree perfect if and only if opt satisfies Properties^ [H anrf0 
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Figure 6. Decision trees for Lemma [5TJ 

Proof, "only if" . See Lemma [21] 

"if" . We proceed by structural induction on all possible arguments of norm opt , 
that is, on all consistent decision trees. In the base step, we prove the implication 
for trees consisting of only a single node. In the induction step, we prove that if 
the implication holds for the subtrees at every child of the root node, then the 
implication also holds for the whole tree. 

First, if the decision tree T has only a single node (a reward, and no further 
children) then subtree perfectness is trivially satisfied. 

Next, suppose that the consistent decision tree T has multiple nodes. Let 
{Ni,..., N n } = ch(T) be the children of the root node of T, and let T % = st Nz (T). 
The induction hypothesis is: subtree perfectness holds for all subtrees at every child 
of the root node, that is, for all Ti. More precisely, for all i £ {1, . . . , n}, and all 
nodes L £ 2} which are in at least one element of norm op t(Ti) 

st L (norm opt (ri)) = norm op t(st L (r j; )). 

We must show that 

stAr(norm opt (T)) = norm opt (stiv(T')) 

for all nodes N in T such that N is in at least one element of norm opt (T). By 
Lemma 1161 and the induction hypothesis, it suffices to prove the above equality 
only for N £ ch(T), that is, it suffices to show that 

(13) st Nt (normopt (T)) = norm opt (Tj) 

for each i £ {1, . . . ,n} such that Ni is in at least one element of norm opt (T). 

If T has a chance node as its root, that is, T — Q7—i E/Ti, then all Ni are 
actually in every element of norm opt (r), so we must simply establish Eq. (|13|) for 
all i £ {1, . . . , n}. Observe that, if we can establish 

n 

(14) norm opt (T) = Q E t norm opt (Ti), 

i=i 

then Eq. (JT3J) follows immediately. Indeed, by Eq. ([6]), 

gamb(norm opt (T)) = opt(gamb(T)|ev(T)) 

and by the definition of the gamb operator, Eq. (|4bj) in particular, 

= opt ( 0£ 4 gamb(T 2 ) 

\ i=l 
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and so by Lemma \T7\ 

n 

= E i opt(gamb(T l )|ev(T) n E t ) 
»=i 

so, since ev(T) n E i — ev(Tj), and again by Eq. ([6]), 

n 

= © ^ gamb(norm pt(Ti)) 
»=i 

Whence, Eq. (llH) follows by Lemma 

Finally, assume that T has a decision node as its root, that is, T — |_l"_i Ti- Let 
I be the subset of {1, . . . , n} such that i £ I if and only if iVj is in at least one 
element of norm opt (T). We must establish Eq. flT5|) for all i £ X. Equivalently, we 
must show that 



(15) 

Indeed, by Eq. 



norm opt (T) = nfd ^ |_J norm op t(?i)^ . 



gamb(norm opt (T)) = opt(gamb(T)|ev(r)) 
and by the definition of the gamb operator, Eq. (|4c[) in particular, 

= opt ^ygamb(Ti) ev(T)^J 

and so by Property [51 

= |J opt(gamb(T J )|ev(T)), 

iei* 

where I* = {i G {1, . . . , n} : gamb(Tj) n opt(gamb(T)|ev(T)) ^ 0}, and so because 
ev(T) = ev(Tj), and again by Eq. ©, 

= [J gamb(norm op t(T J )). 

iei* 

Hence, the conditions of Lemma [201 are satisfied, and I* = I by Lemma so 
Eq. HU is established. □ 

6. Backward Induction 

Although Selten's [23 definition of subgame perfectness does not explicitly refer 
to backward induction, one of its aims is unmistakably to identify when a game's 
equilibrium point can be found by backward induction. In fact, when maximizing 
expected utility, as in Selten's work, there seems little reason to distinguish be- 
tween subtree perfectness and backward induction because the optimal decisions 
are essentially unique: multiple optimal decisions will have the same expectation, 
so it suffices to consider only a single representative. 

However, if our concept of choice has no reference to expectation, multiple opti- 
mal decisions are not necessarily equivalent in any sense. Consequently, backward 
induction does not need to be tied to subtree perfectness. 
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Therefore, elsewhere j8], we suggested the following backward induction method, 
which generalizes classical backward induction to arbitrary choice functions, and 
which is useful when gamb(T) is very large and applying opt in one go is not 
feasible. We first extend norm op t to act on sets of decision trees. 

Definition 23. Given a choice function opt and any set T of consistent decision 
trees, where ev(T) = A for all T G T, 

norm opt (T) = {U 6 nfd(T): garxib(J7) C opt(gamb(T)|A)}. 

The goal of our backward induction algorithm is to reach a normal form solution 
of T by finding normal form solutions of subtrees of T, and using these solutions 
to remove some elements of gamb(T) before applying opt: 

Definition 24. The normal form operator back opt is defined for any consistent 
decision tree T through: 

• If T consists of only a leaf with reward r G 1Z, then 
(16a) backet (T) = {T}. 

• If T has a chance node as root, that is, T — 0»=i E{Fi, then 

(16b) back p t (q EiT^J = normopt |Q E, back opt (T;)^ 

• IfT has a decision node as root, that is, ifT — \_\™ =1 Ti, then 
(16c) back p t ^j_J T^j = norm opt |J backopt (Ti) . 

This algorithm is almost identical to that suggested by Kikuti et al. [10]: they 
apply opt only at decision nodes, whereas we apply opt at all type of node. 

It is instructive to compare the definition of the gamb operator (Definition [3]) 
with Definition 1241 The main difference is that back op t inserts norm op t at every 
stage, to remove as many normal form decisions as possible, early on. 

If back op t = norm opt , we can use the former as an efficient way of calculating 
the latter. Of course, this only works if a normal form gamble that is non-optimal 
in a subtree at a node cannot be part of an optimal gamble in the full tree. It is 
well known that choice functions exist for which this property does not hold: for 
examples, see LaValle and Wapman [11], Jaffray [9], and Seidenfeld [22]. In such 
cases, there exist trees such that back opt (T) ^ norm op t(T). 

The following four properties are necessary and sufficient for back op t to coincide 
with norm opt . 

Property 7 (Backward conditioning property). Let A and B be events such that 
A n B 7^ and Ad B ^ %, and let X be a non-empty finite A D B-consistent set of 
gambles, with {X, Y} C X such that AX = AY. Then X G opt(X\A n B) implies 
Y G opt(A"|j4 D B) whenever there is a non-empty finite A D B-consistent set of 
gambles Z such that, for at least one Z G Z, 

AX © ~AZ G opt(AX © AZ\B). 

This is clearly just a minor relaxation of Property [1] 
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Property 8 (Insensitivity of optimality to the omission of non-optimal elements). 

For any event A 7^ 0, and any non-empty finite A-consistent sets of gambles X and 

y, 

opt(X\A) cyc^^ opt(y\A) = opt(X\A). 

If opt satisfies this property, then removing non-optimal elements from a set does 
not affect whether or not each of the remaining elements is optimal. The property 
is called 'insensitivity to the omission of non-optimal elements' by De Cooman and 
Troffaes [4], and 'property e' by Sen [24] who attributes this designation to Douglas 
Blair. 

Property 9 (Preservation of non-optimality under the addition of elements). For 
any event A^§, and any non-empty finite A-consistent sets of gambles X and y, 

y c x => apt(y\A) 2 opt(x\A) n y. 

This is 'property a' in Sen [21], Axiom 7 in Luce and Raiffa [15, p. 288], and 
'independence of irrelevant alternatives' in Radner and Marschak [H)] . 

Property 10 (Backward mixture property). For any events A and B such that 
B n A ^ and B n A ^ 0, any B n A-consistent gamble Z , and any non-empty 
finite B n A-consistent set of gambles X , 

opt [AX © AZ\B) C A opt (A" I A n B) © AZ. 

This is an "inclusion-only" version of Property [3] We do not need the full 
property because backward induction moves from right to left. 

The proof of the following theorem is, up to obvious details, identical to the 
proof of Theorem [22] and is therefore left as an amusing exercise to the reader. 

Theorem 25 (Backward induction theorem). A normal form operator norm opt 
satisfies backward induction (that is, back opt (T) = norm opt (T) for every consistent 
decision tree T) if and only if opt satisfies Properties [7J [3] 01 and ] 101 

If back op t(T) = norm opt (T) for any consistent T, then, at decision nodes, back opt (T) 
coincides with the method of Kikuti et al. [TU] ■ 

7. Discussion 

7.1. Relationship Between Subtree Perfectness and Backward Induction. 

Obviously, Property[T]implies Property!?] and Property[3]implies PropertvlTTfl Also, 
it is easily shown that: 

Lemma 26. Property [J] implies Properties^ and\Q 

Corollary 27. 7/norm op t is subtree perfect, then norm op t = back op t. 

Subtree perfectness is, however, not necessary for backward induction. For ex- 
ample, point- wise dominance satisfies Properties Q] [S] andfTUl but as we saw in 
Example rT2] it lacks subtree perfectness. 

Backward induction does imply a weaker form of subtree perfectness. Suppose 
back op t (T) = norm op t(T) for all consistent decision trees. By definition of back opt , 
for any node N that is in at least one element of back opt (T) we have 

stAr (norm opt (T)) = stjv(back op t(T)) C back opt (stjv(T)) = norm op t(stjv(T)). 
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Why can this be seen as a type of subtree perfectness? A subgame perfect 
equilibrium point is one that induces an equilibrium point in all subgames. A 
subtree perfect normal form operator is one that induces its normal form solution 
in all subtrees. Theorem 1251 implies every optimal normal form decision induces an 
optimal normal form decision in any subtree. So although we do not have subtree 
perfectness of solutions we do have subtree perfectness of decisions. 

Definition 28. norm is subtree perfect for normal form decisions if for every N 
in T such that N is in at least one element o/norm(T), 

stjv(norm(T)) C norm(stiv(T)). 

Obviously, backward induction implies subtree perfectness for normal form de- 
cisions. But, the opposite implication does not hold: Property [5] is not necessary 
for subtree perfectness for normal form decisions. 

Theorem 29. For any choice function opt, norm opt is subtree perfect for normal 
form decisions if and only if opt satisfies Properties^\Q and \l(A 

7.2. Normal Form and Extensive Form Equivalence. Extensive form solu- 
tions can be seen as more natural solutions to decision trees, because they represent 
the sequential nature of the decision making, whereas the normal form solutions 
remove the sequential aspects. It is common to ask whether there is an equiva- 
lence between certain normal form operators and extensive form operators. For our 
definitions of normal and extensive form, this is very different from the of "nor- 
mal form/extensive form coincidence" of McClennen [17j p. 115]. In McClennen's 
terms, the normal form and extensive form refer to the structure of the tree, and 
coincidence requires the solution of the two forms to be the same. In the case of 
norm opt , this coincidence will always occur by definition. 

The clearest equivalence arises when norm(T) = {ext(T)}, that is, when the 
extensive form solution is itself a normal form decision, and is also the only element 
of the normal form solution. Even when using choice functions corresponding to 
total preorders, this may not arise for every decision tree, so a more general concept 
of equivalence is required. We shall consider an extensive form operator and a 
normal form operator to be equivalent if, for every consistent decision tree T, 

norm(T) = nfd(ext(T)). 

If such an equivalence holds, it is easy to move from the extensive form solution to 
the normal form solution, simply by finding all normal form decisions. It is also 
easy to move from the normal form to extensive form, as the following result shows. 

Lemma 30. Suppose norm and ext are equivalent. Then a node N is in ext(T) if 
and only if N is in at least one element of norm(T) . 

If we can find equivalent normal form and extensive form operators, then either 
both are subtree perfect, or neither is. 

Lemma 31. Suppose that ext and norm are equivalent. Then, norm is subtree 
perfect if and only if ext is subtree perfect. 

With these results at hand, normal-extensive form equivalence is easily estab- 
lished, using structural induction as usual (the proof is left to the reader): 
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Theorem 32. If a normal form operator norm op t induced by a choice function 
opt is subtree perfect, then there exists an equivalent subtree perfect extensive form 
operator ext. 

Perhaps surprisingly, there are subtree perfect normal form operators (necessarily 
not induced by a choice function) which have no equivalent extensive form operator. 
Indeed, the proof of Theorem [32] relies on the following consequence of Lemmas \T7\ 
and[H 

norm opt (Q" =1 EiT t ) = Q"=i E i norm opt (T i ), 

General subtree perfect normal form operators need not satisfy this. For instance, 
the example of Section [3. II can yield a subtree perfect normal form operator which 
has no equivalent extensive form representation. 

7.3. Related Work. As mentioned earlier, our results have strong links with the 
work of Hammond [6 , Machina [16] , and McClennen [17] . 

Machina [16] assumes probabilities at chance nodes and choice functions that 
correspond to total preorders. Under these assumptions, a necessary condition 
for subtree perfectness of norm opt is that opt satisfies separability over mutually 
exclusive events. Consider n possible events with probabilities p±, . . . , p„, a gamble 
giving reward r,- S Tt if event i obtains, and a second gamble that gives reward 
if event i occurs for i > 1 and r, G K for i = 1. Separability says that the second 
gamble is preferred to the first if and only if r* is preferred to ri . 

There are various ways to adapt separability for our more general setting. For 
example, consider a non-trivial event A and gambles X and Y such that AX = AY, 
AX = Ar\, and AY = Ar2 for some rewards r\ and r 2 £ 1Z. Separability could 
be: opt({X,Y}) = {X} if and only if opt({n,r 2 }) = {n}. Note that Property [3] 
implies this. Indeed, Property [3] can be seen as a strong form of separability, for it 
implies every reasonable generalization of separability. 

As already noted, Hammond's [5] results are slightly different from ours due 
to the definition of decision trees, and whether gambles involving probabilities are 
admitted. If such details are dealt with, then his results become very similar to 
ours. Using our terminology and notation, Hammond's first defines: 

Definition 33. An extensive form operator ext is consistent if it is subtree perfect. 

Definition 34. An extensive form operator ext is conscqucntialist if, for any de- 
cision trees T\ and T 2 such that gamb(Ti) = gamb(T 2 ) and ev(Ti) = ev(T 2 ) ; 

gamb(ext(Ti)) = gamb(ext(T 2 )). 

Definition 35. An extensive form operator ext is strongly consequentialist if it 
is consequentialist and, for any U € nfd(T) such that gamb([7) C gamb(ext(T)) 7 
U e nfd(ext(T)). 

Note that Hammond does not use the term "strongly consequentialist". He 
writes that consequentialism means that decisions should be valued by their gam- 
bles, which corresponds to strong consequentialism. Yet, his only mathematical 
definition of consequentialism seems to be identical to Definition [34] The difference 
is small, but important to link our results with his. 

Hammond argues that consistent and (not necessarily strongly) consequentialist 
extensive form operators induce a choice function on gambles as follows. 



21 



Definition 36. For a consistent and consequentialist extensive form operator ext, 
define its corresponding choice function opt ext by 

opt ext (X\A) = gamb(ext(T)), 

where T is any consistent decision tree with gamb(T) = X and ev(T) = A. Be- 
cause ext is consistent and consequentialist, this choice function exists and does not 
depend on the choice of T . 

With these definitions, we can prove a slightly stronger version of Hammond's 
results 6, Theorem 5.4, Theorem 6, Theorem 7, and Theorem 8]. 

Theorem 37. A choice function opt satisfies Properties\J\\^ and\^if and only if 
there is a consistent and strongly consequentialist extensive form operator ext such 
that opt ext = opt. 

Proof, "if". Follow the approach of Hammond [SJ Theorem 5.4 and Theorem 7]. 
Note that these proofs require only consequentialism. 

"only if" . Suppose opt satisfies Properties [TJ [3J and EH By Theorems [22] and EH 
norm opt is subtree perfect and has an equivalent subtree perfect extensive form 
operator ext. By definition of norm opt , for any strategically equivalent trees T\ 
and T 2 , gamb(norm opt (Ti)) = gamb(norm opt (T2)), and so the same holds for ext. 
Hence, ext is consistent and consequentialist. By construction, ext is also strongly 
consequentialist, and obviously also opt oxt = opt. □ 

It is easily seen that, for a particular choice function opt, there is exactly one 
consistent and strongly consequentialist extensive form operator that induces opt. 
Therefore, there is an equivalence between the consistent and strongly consequen- 
tialist extensive form operator inducing opt and the subtree perfect normal form 
operator induced by opt. This equivalence is not present in Hammond's account, 
since multiple consistent and (not strongly) consequentialist extensive form opera- 
tors can induce the same choice function. 

For example, if T has two normal form decisions inducing the same gamble X, 
nfd(exti(T)) could only include one, while nfd(ext2(T)) includes both, without vio- 
lating subtree perfectness. Moreover, gamb(T) contains X so strategic equivalence 
is preserved as required. Hence, both operators can be consistent and consequen- 
tialist, however only at most one of these can be equivalent to norm opt . Indeed, a 
consistent and consequentialist extensive form operator can only be equivalent to 
its corresponding norm opt if it is strongly consequentialist. 

As with Hammond, McClennen's [T7] decision trees differ in that some chance 
nodes can have probabilities for events. Also, there seems to be no concept of 
conditioning in McClennen's account. As noted in Section[3l McClennen's dynamic 
normal form solutions are more general types of normal form solutions. Some of 
his results [TTl Theorems 8.1 and 8.2] are similar to ours, and are based on three 
restrictions placed on his solutions. 

Definition 38 (McClennen, |171 p. 120]). A dynamic normal form solution satisfies 
dynamic consistency if for every node N in the tree, the restriction of the optimal 
set at the root node to N is exactly the optimal set at N . 

Obviously, there is a one-to-one correspondence between dynamic normal form 
solutions satisfying dynamic consistency, and our normal form solutions. 
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Definition 39 (McClcnnen, [17J p. 114]). A dynamic normal form solution satisfies 
plan reduction if for every normal form decision in T that induces the same gamble, 
either all or none of them are optimal. 

By definition, norm op t satisfies plan reduction. 

Definition 40 (McClennen. |171 p. 122]). A dynamic normal form solution satisfies 
separability if for any tree T and any node N in T, the set of optimal plans at N 
is the same as the set of optimal plans of the separate tree stjv(T). 

On its own, separability is not exactly subtree perfectness, but if dynamic con- 
sistency holds then the two properties are equivalent. McClennen's two theorems 
can then be adapted into our setting as: 

Theorem 41. If a dynamic normal form solution satisfies plan reduction, dynamic 
consistency, and separability, then then it coincides with norm opt for a choice func- 
tion opt satisfying Properties^ and [3 

Proof. The proof is essentially identical to that of Lemma [2TJ □ 

8. Conclusion 

We extended Selten's idea of subtree perfectness to decision trees, for normal 
and extensive for solutions. Subtree perfectness for extensive form solutions is 
Hammond's consistency condition. Subtree perfectness for normal form solutions 
is, under the assumption of dynamic consistency, McClennen's separability. We 
found necessary and sufficient conditions for a choice function to induce a subtree 
perfect normal form operator. These turned out to be similar to, but stronger than, 
those for backward induction to work. So, even if a normal form operator lacks 
subtree perfectness, it may still be possible to find the normal form solution by 
backward induction. 

While many choice functions satisfy Property [TJ Properties [5] and [3] are perhaps 
more restrictive than one would like. Is violating subtree perfectness acceptable? 
We believe that subtree perfectness is a desirable property and one must think 
carefully before abandoning it. On the other hand, if one is attracted to the three 
properties for other reasons, then subtree perfectness gives them a strong justifica- 
tion, particularly since (at least for Properties [2] and they are much more difficult 
to justify in a static setting. Attempts to justify violation of subtree perfectness, 
without violating dynamic consistency, have however been made, for example by 
Machina [16] and McClennen [171 9.6], 

We recovered the well-known fact that subtree perfectness requires total pre- 
ordering. Many choice functions suggested in the literature, such as maximality [2j)J 
Sec. 3.9] and E-admissibility [13], violate total preordering, and hence fail subtree 
perfectness. Interestingly, we can easily establish that some of these (particularly, 
maximality and E-admissibility) still admit backward induction. 

If one is committed to the idea of subtree perfectness but also wishes to use 
a choice function that fails Property [21 then the best solution may be to use an 
extensive form operator. One can easily define subtree perfect normal form solutions 
based on choice functions (but not directly induced as in Definition [6]) . These, 
however, can have unpleasant behaviour such as admitting pointwise dominated 
options. It is easier to avoid such behaviour with subtree perfect extensive form 
solutions, as proposed for instance by Seidenfeld [21] . although Seidenfeld's idea 
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will only work if the choice function can model complete ignorance. Roughly, this 
solution is found by backward induction, and when a decision node admits multiple 
optimal options, then it is treated as a chance node with complete ignorance about 
which of the optimal decisions is chosen. 

We have seen that, when using choice functions on gambles, subtree perfectness 
is closely related to equivalence between extensive form and normal form operators, 
both using our approach of defining an operator based on a choice function, and 
Hammond's approach of defining a choice function based on an operator. Interest- 
ingly, normal-extensive form equivalence need not hold for subtree perfect operators 
that are not induced by choice functions. 

Although we have primarily investigated norm op t in this paper, we do not argue 
that a normal form operator, and norm op t in particular, gives the best solution to 
a decision tree. A normal form solution requires a policy for all eventualities to 
be specified and adhered to. The subject adheres to this policy only by his own 
resolution: he may of course have the ability to change his policy upon reaching a 
decision node |21j . One could therefore argue that a normal form solution is only 
acceptable for sequential problems when the subject does not get the chance to 
change his mind (for example, if he instructs, in advance, others to carry out the 
actions). 

In practice, applying opt to the set of all normal form gambles may be difficult. 
Therefore, we defined a normal form operator which yields a normal form solution 
by means of backward induction, and which will, in many cases, be easier to apply 
than norm opt . We found necessary and sufficient conditions on opt for our backward 
induction algorithm to yield exactly norm opt . As mentioned, we found that total 
preordering is not necessary for backward induction to work: the set of choice 
functions that satisfy the backward induction properties are a subset of those that 
satisfy subtree perfectness for normal form solutions and a superset of those that 
satisfy subtree perfectness. Hence, it remains unclear whether backward induction 
has any justification beyond practicality. 
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Appendix A. Proofs 

A.l. Proof of Eq. ©. 

Lemma 42. For any decision tree T, gamb(T) = gamb(nfd(T)). 

Proof. We prove this by structural induction. In the base step, we prove the equality 
for trees comprising only one node. In the induction step, we prove that if the 
equality holds for the subtrees at every child of the root node of T, then the equality 
also holds for T. 

If T consists of only a single node, namely a reward node, then nfd(T) = {T} 
and the result holds trivially. Thus the base step is confirmed. 

Suppose T has a chance node at the root, that is, T = 0™ =1 E{Fi. Each element 
of nfd(T) is of the form 0™ =1 Ui, where t/j G rrfd(Tj). In other words, nfd(T) is the 
set of all possible mixtures of the elements of nfd(T,), that is, 

n 

nid{T) = QE l nfd{T i ). 

8=1 

The induction hypothesis is gamb(T , i ) = gamb(nfd(T i )) for each i. We have 
gamb(nfd(T)) = gamb ( nfd(T^ 

n 

= Ei gamb(nfd(T l )) 
i=i 

n 

= 0£ ig amb(Ti) 

i=l 

= gamb(T). 

On the other hand, if T has a decision node as a root, that is, T = |J" =1 Tj, then 
nfd(T) = Lu:Ue\J nfd^^j, 
and, since gamb(UC7) = gamb(J7) for any U, 

gamb(nfd(T)) = i gamb(Uf/): U € [j nfd(T,) i 



= |gamb(C7): U G Q nfd(T() 
= (J gambtnfd^)) 



i=l 



and again, the induction hypothesis says that gamb(Xi) = gamb(nfd(Ti)) for each 
i, so 

n 

= |J gamb(Ti) 
= gamb(T). 
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This completes the induction step. □ 
A.2. Equivalence of Definition |^A)] and (B) , 



Proof of equivalence. |(A)| => |(B)| We prove the implication by structural induc- 
tion on the tree T. In the base step, we prove that, for every X and non-empty 
event A, the implication holds for consistent decision trees which consist of only a 
single node. In the induction step, we prove that if, for all X and non-empty A, 
the implication holds for the subtrees at every child of the root node, then, for all 
X and non-empty A, the implication also holds for the whole tree. 

First, if T consists of only a single node, namely a reward node, then gamb(T) = 
{s} for some s € 1Z, so by assumption, X = gamb(T) = {X} where X is the gamble 
yielding a constant value s. Clearly, X _1 (r) = for all r ^ s and X^ 1 (s) = O, 
hence indeed X~ x (r) n A = A ^ whenever X _1 (r) ^ 0, and this for every 
non-empty event A. 

Next, suppose T has a chance node as its root, that is, T = 0? , £JjTj. By 
assumption, and by the definition of gamb (see Definition [3]) , 

n n 

X = gamb(T) = Ei gamb(T i ) = E t X t 

i=l i=l 

where Xi is a shorthand notation for gamb(Ti). Also, by assumption, ev(T) = A. 
By the induction hypothesis, we know that for every r 6 TZ and every Xi £ Xi such 
that Xr\ r ) ^ 0, it holds that X^ x {r) DAnEi ^ (indeed, ev(T,) = ev(T)DE t = 
A (~l Ei). Now we have all ingredients to prove the desired implication. Indeed, for 
any X s X, or equivalently, for any X = EiXi with Xi E X i7 

/ n \ _1 n 

(17) X-\r)= 0M (r) = |JXr 1 (r)n^ 

\i=l / i=l 

Hence Eq. (TlT)) implies that AT _1 (r) ^ whenever Xf (r) CiEi ^ for at least one 
i 6 {1, . . . , n}, and in that case, it obviously follows that also X~ 1 (r) ^ 0, which 
implies, as just shown, that AT i _1 (r) Ci An Ei ^ 0. But, then, again by Eq. dT7|) . it 
must also hold that X^r) n A ^ 0. 

Finally, suppose that the root of T is a decision node, that is, T — |_l™=i By 
assumption, and by the definition of gamb (see Definition [3]) , 

n n 

X = gamb(T) = (J gamb(T<) = |J 
t=i i=i 

where ^ is a shorthand notation for gamb(T^). Also, by assumption, ev(T) = A. 
By the induction hypothesis, we know that for every r £ TZ and every Xi £ Xi such 
that Xr\ r ) ^ 0, it holds that X^{r)C\A ^ (indeed, ev(T;) = ev(T) = A). But, 
for any X £ X, it follows that X = Xi for some Xi £ Xi, so the desired implication 
follows immediately from the induction hypothesis. 

(B) => |(A)| Suppose that for every r € 1Z and every X 6 X such that AT~ 1 (r) 



it holds that X 1 (r) n A ^ 0. Consider the decision tree 



X£X rGlZ 
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with ev(T) = A. Let N(X) denote the chance node of T associated with X, and let 
N(X, r) denote the reward node of T associated with X and r (of course N(X, r) 
only exists for A _1 (r) ^ 0, by definition of T). 

Clearly, T is consistent, because ev(stjv(js:) (T)) = ev(T) = A ^ and ev(st7v(x,r) (^)) 
ev(st A r ( x)(7 1 )) n A" _1 (r) = in X -1 (r) ^ by assumption, and 



/ 



gamb(T) = gamb 



xex 



\ 



rETZ 



J 



= U X-Hr)r\= [j{X} = X 



X£X | r£R 

.X^(r)^ 



which establishes |(A)| 

A. 3. Proof of Lemma 1151 



xex 



□ 



Proof. In this proof, A is a non-empty event and all gambles are A-consistent. 
Property [2]=^ Property [5] Let Xi, ... ,X n be non-empty finite sets of gambles, 



and let X = |J" =1 X,. If opt(A?|A) n Af fc ^ 
Hence, 



then opt(Afe|^) = opt(X\A) n A fe 



opt(#|A) = Q opt(^|A) n ^ = Q opt(A" fc |A) 



fc=i 



fc=i 

A' fe nopt(A'|A)#0 



Property [5] => Property |H Immediate. 

Property 12=^ Property HI Define X Y if X G opt({A, Y}|A). First, we 
prove that is a total preorder (i.e. total, reflexive, and transitive). Clearly, 
hA is total since X G opt({A,Y}|A) or Y G opt({X, Y}|A), hence I F or 
Y >^yi A, for all normal form decisions X and Y. Obviously, >A is reflexive. Is 
transitive? Suppose A Y and Y Z. 

By Property HI 

(opt({Y,Z}\A), or 
oj>t({X,Y, Z}\A) = < {A}, or 
[opt({Y,Z}| J 4)U{A}. 

Since, Y ^ A it follows that {A, Y} n opt({A, Y, Z}|A) ^ 0. 
Again, by Property HI 

fopt({A,Y}|A), or 
opt({A,Y,Z}|A)= ^ {Z}, or 
[opt({A,Y}|A)U{Z}. 

The case opt({A, Y, Z}\A) = {Z} cannot occur however, because we just showed 
that {A, Y}nopt({A, Y,Z}\A) 7^ 0. Hence, because A G opt({A, Y}\A), it follows 
that A G opt({A,Y,Z}|A). 
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Once more by Property IH 

(opt({X, Z}\A), or 
opt({X,Y,Z}\A) = I {Y}, or 
[opt({X,Z}\A)U{Y}. 

We just showed that X G opt({X, Y, Z}\A), hence the second case cannot occur, 
and it can only be that also X G opt({X, Z}|A), establishing X ^a Z. 
Finally, we prove that 

opt(X\A) = {X e X: (VY G X){X hA Y)}, 

or equivalently, we prove for any X G X that X G opt(A"|A) if and only if X G 
opt({X,Y}\A) for all Y e X. 

Indeed, by Property HI for any X and Y in X, it holds that 

fopt({X,y}|A), or 
opt(X\A) = I opt(X \ {X,Y}\A), or 
[opt({X, Y}\A) U opt(* \ {X, Y}\A). 

and hence, if X G opt(A , |^4) then the second option is impossible and therefore 
X G opt({X,Y}\A) for all Yey. 
Conversely, again by Property |4] 



jpt(Af|j4) = opt ( |J {X,Y} 
\Yex 



a\=\J o P t({x,y}|A) 

/ Yey 



for some subset y of X, and hence, if X G opt({X, Y}\A) for all Y £ X, then 
X G opt(X\A). 

Property [6] =^ Property H Assume that opt (A" | A) n y ^ 0. This means 
that there must be an Y* G y such that Y* >a X for all X G <Y. Clearly, 
y* G opt{y\A). But, for all Y G opt(^|A) it must also hold that Y > A Y*, and 
hence Y >a X for all X G X as ^ is transitive. We conclude: 

opt(^|A) = {Y G y : (VX G X)} = opt(A , |^l) n y 

□ 

A. 4. Proof of Lemma 1 161 

Proof. If N is the root of T, then the statement is trivial. If N G ch(T), then 



the statement follows from (i) Otherwise, N must belong to st k(T) for some 
K G ch(T). 

First, note that K is a node of at least one element of norm(T). Indeed, it is 
given that TV is a node of at least one element, say U, of norm(T). Then, obviously, 
K must also be a node of U, simply because any node on the unique path within 
T between the root of T and N must be a node of U, and one of those nodes is K. 
So, if is a node of an element of norm(T) (namely, U). 

Secondly, note that N is also a node of at least one element of norm(st#-(T)). 
Indeed, TV is a node of an element of norm(T), and hence, in particular also of 



stif (norm(T)). But, by (i) and the fact that TV is a node of at least one element of 
norm(T) (as just proven), it follows that stjf (norm(T)) = norm(stj<-(T')). Hence, 
N is also a node of at least one element of norm(stif (T)). 



2!) 



Combining everything, it follows that 

norm(stAr(T)) = norm(stjv(st.K-(T))) 

so, by |(ii)| and because N is in at least one element of norm(st/f (T)), 

= stjv(norm(st k{T))) 



hence, by (i) and since K is in at least one element of norm(T), 

= stjv(st#-(norm(T))) = stAr(norm(T)). 

A. 5. Proof of Lemma 1171 



□ 



Proof. The statement is trivial if n — 1 (because, in that case, A\ = f2). Let us 
prove the statement also in case n > 2. 

Let X — Qh—i AiXi. Consider any k G {1, . . . , n} and let 

where (Aj )j 1 tk forms an arbitrary partition of f2 such that Ak nA'j = Aj for all j ^ k. 
Clearly, Zk is A k H B-consistent because we can trivially find a consistent decision 
tree T with ev(T) = Ak DB and gamb(T) = Z kl using the Aj n B-consistency (and 
hence, Ak fl Aj n ^-consistency) of each Xj for j ^ fc. 
Now, observe that by construction of Zk, 

X = A k X k ®A k Z k = (J (A k X k ®A k Z k ). 

Note that X is inconsistent (indeed, because each Xi is Aj fl -B-consistent, we can 
trivially find a consistent decision tree T with ev(T) = £> and gamb(T) = A?). 

Since Property [2] holds, Property |4] holds as well by Lemma fT5l So, if we apply 
opt(-|£?) on both sides of the above equality, then it follows from Property |4] that 

opt(*|£)= |J opt(A k X k ®A k Z k \B), 

z k ez* h 

for some Z%. CZj. By Property [31 

= (J {A k opt{Xk\A k riB)(BAkZ) 

z k ez; 

(18) =A fc opt(^ fc |A fc nB)e3 fc ZjJ 

Since this holds for each k € {1, . . . , n}, we arrive at Eq. ([9]), by Lemma l43l □ 

We used the following lemma. 

Lemma 43. Let A\, . . . , A n be a finite partition oftt, n > 2. Let X , and X\, Z\, 
. . . , X n , Z n be non-empty finite sets of gambles. If 

X = A k X k ®A k Zk 

for all k S {1, . . . , n}, then 

n 

X = AiXi. 
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Proof. Let us prove the implication by induction. 

The implication holds for n = 2. Indeed, suppose that 

X = A x Xx © A 1 Z 1 = A 2 X 2 © ~A 2 Z 2 . 

By multiplying all elements of these sets with A 2 , and noting that A\ — A 2 and 
A 2 = Ai, we find that 

A1Z1 = A 2 X 2 

which establishes the base step. 

For the induction step, assume that the implication holds for n = m. We prove 
that the implication also holds for n = m + 1 . Suppose that 

X = A\X\ ®A\Z X 
X = A 2 X 2 ®~A 2 Z 2 

X = A m+ \X m+ \ © A m+1 Z m+ i 

First, note that the equality AiXi © A\Z\ = A 2 X 2 © ^2-22 implies in particular 
that (by multiplying all elements of these sets with A{) 

~A\Z X = A 2 X 2 © (At n A 2 )Z 2 

Hence, 

X = A\X\ ®~A X Z X = A\X\ © ~A~i{A 2 X 2 © (At n A 2 )Z 2 ) 
= A\X\ © A 2 X 2 © (Ai n A 2 )Z 2 
= {A 1 U A 2 )(A 1 X 1 © ^2^2) © (Ai n A 2 )Z 2 

With A\ = Ax U A 2 , X* = A X X X © A 2 X 2 , and Z{ = Z 2 , we now have that 

X = A\X* © ~A\Z* X 
X = A 3 X 3 (SA 3 Z 3 

X = A m +iX m +i © A m +iZ m -]-i 

By assumption, the implication to be proven already holds for n 
above equalities, it follows that 

X = A\X* © A 3 X 3 © ... © A m+1 X m+1 
and, by construction of A\ and X{ , 

m+l 
i=l 

which proves the induction step. 



= m, so, from the 



□ 
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A. 6. Proof of Lemma 1181 

Proof. By Eq. ©, Eq. (jTOj) holds if and only if 

gamb(nfd(T J )) PI opt (gamb (T)|ev(T)) ^ 0, 

or equivalently, if and only if there is a normal form decision U G nfd(Ti) such that 
gamb(J7) C opt(gamb(T)|ev(T)) (remember that gamb(£7) is a singleton). 

But, by definition of the gamb operator, it also holds that gamb(£7) = gamb(l_l£/). 
Hence, Eq. (fT0|) holds if and only if there is a normal form decision U G nfd(Tj) 
such that gamb(U£7) C opt(gamb(T)|ev(T)), or equivalently, if and only if there 
is a normal form decision V G nfd(T) which contains the node Ni, such that 
gamb(V) C opt(gamb(T)|ev(T)). 

By definition of norm opt , this is equivalent to stating that Eq. (|10j) holds if and 
only if Ni is in at least one element of norm opt (T) . □ 

A. 7. Proof of Lemma 1191 

Proof. Assume that Eq. (fTTj) holds. 

First, consider a normal form decision U G Oi=i norm opt {Ti ) . Obviously, 

gamb(Z7) C gamb ^ Q E, norm opt (T i )^ 
so, by the definition of gamb, Eq. (|7|) in particular, 

a 

= ® Ei gamb(norm opt (r i )) 

i=l 

and hence, by Eq. (jTTJ) , 

= gamb(norm opt (T)). 

So, there exists a normal form decision V G norm opt (T) such that gamb(V^) = 
gamb([7). Since U G nfd(T), by definition of norm opt we have U G norm opt (T). So 
we have shown that 

n 

norm opt (T) D E, norm opt (Tj). 

i=i 

Next, consider a normal form decision t/ G norm opt (T). We know by Eq. 
that 

n 

gamb(C/) C Ej gamb(norm opt (Tj)). 

i=l 

We can write U = 0™ =1 EjC/j, where E7j G nfd(Tj), so 

n n 

Ei gamb([/ l ) C ^ gamb(norm opt (T i )). 

i=l i=l 

Consider any k and any normal form decision Vk G norm opt (Tfc ) . The above 
equation, and Eq. ©, tell us that we can choose each Vk such that Ek gamb(Vfe) = 
Efcgamb(C/fe). Of course, because Vk G norm opt (Tfe), 

gamb(y fc ) C opt(gamb(T fe )|ev(T fc )). 

We wish to establish that also gamb(£4) C opt (gamb (Tfc)|ev(T&)). 
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Indeed, this follows from Property [TJ observe that both singletons gamb(Uk) and 
gamb(Vfe) are subsets of gamb(Tfe), Ek gamb(£//c) = Ek gamb(V/c), and gamb(Vfe) C 
opt(gamb(Tfe)|ev(Tfe)). Consistency of T confirms that gamb(Tfc) is ev(Tfc)-consistent 
Hence, Property [T] applies, and 

gamb(£4) C opt(gamb(T fc )|ev(T fc )). 

Therefore, Uk £ norm opt (Tfc) by definition of norm opt (Ti). Since this holds for 
any k, we conclude that U £ 0™=i norm pt(Ii). So, we have shown that also 

n 

norm opt (r) C (T) Ei norm pt(Tj). 
i=i 

□ 

A.8. Proof of Lemma l20l 

Proof. Assume that Eq. ([TJ]) holds. 

Consider any normal form decision V £ nfd norIn opt(^i)) ■ By definition 

of gamb, 

gamb(V) C (J gamb(norm op t(T t )) 

iex 

and, by Eq. flT2J), 

= gamb(norm opt (T)). 

Hence, by definition of norm opt , and the obvious fact that V £ nfd(T), it follows 
that V £ norm pt (T) . So we have shown that 

norm pt (T) 2 nfd ^ |_J norm opt (r i )^ . 

Conversely, let V £ norm opt (T). Then, again by Eq. (|12l) . 

gamb(F) C gamb(norm opt (T)) = gamb(norm opt (T i )). 

iex 

Now V = UU where U £ nfd(Tj) for some i £ X. We want to show that U £ 
norm op t(T l ). 

Indeed, let X be the gamble corresponding to V, and also U, 

gamb(V) = gamb(LT) = {X}. 

Because V £ norm opt (T), we know that X £ opt(gamb(T)|ev(T)). It is established 
that U £ norm opt (Ti) if we can show that X £ opt(gamb(Tj)|ev(Tj)). But this 
follows at once from Property^ because i £ X (and the definition of X), gamb(Ti) C 
gamb(T), ev(T) = ev(Tj), and all sets of gambles are consistent with respect to the 
relevant events: 

opt(gamb(T I )|ev(T J )) - opt(gamb(T)|ev(T)) n gamb(T 4 ). 

Concluding, also 



norm, 
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A.9. Proof of Lemma HH 

Proof. Assume that norm op t is subtree perfect. 

We first establish Property [TJ Let A be a non-empty event, and let A be a non- 
empty finite set of A-consistent gambles such that {X, Y} C A with AX = AY 
and X G opt(X\A). We show that Y G opt (X \ A). If A — Q the result is trivial, so 
assume A c ft. 

Consider a consistent decision tree T = ATi&AT 2 , where ev(T) = ft, gamb(Ti) = 
X, and Ti is a normal form decision with gamb(T2) = {Z}, an A-consistent gamble 
(see the left tree in Fig. [6]). We know by consistency of the gambles that there is 
such a T (see Definition©. 

Consider U G nfd(Ti) with gamb([7) = {X} and V G nfd(Ti) with gamb(V0 = 
{Y}. By definition of norm opt , we have U G norm op t(Ti). Therefore by subtree 
perfectness, AUQAT 2 G normo P t(T) and of course AV&AT 2 G norm opt (T). Again 
by subtree perfectness, V G norm opt (T i ), whence Y G opt(A|j4). 

Next, we establish Property [2] Let A be a non-empty event, and let y C X 
be non-empty finite A-consistent sets of gambles such that opt (A" | A) fl y ^ 0. We 
show that opt(^|A) = opt{X\A) n ^. 

Let T = Ti U T2 be a consistent decision tree with ev(T) = ^4, gamb(Ti) = y 
and gamb(T2) = A (see the right tree in Fig. [6]). We know by consistency of the 
gambles that there is such a T. 

Let N be the decision node at the root of T\. By subtree perfectness, we have 

gamb(st at (norm opt (T))) = gamb(norm opt (st at(T))). 
The right-hand side is equal to opt(3 ; |A). Also, 

gamb(stjv (normopt (T))) = gamb(norm opt (T)) n gamb(stjv(T)) 

= opt(A|A) ny 

as required. 

Finally, we establish Property |3] Let A and B be events such that A n B ^ 
and A n B 7^ 0, let A be a non-empty finite A n i?-consistent set of gambles, and 
let Z be a A n B-consistent gamble. 

Let T = AT1QAT2 be a consistent decision tree such that ev(T) = B, gamb(Ti) = 
A", and gamb(T 2 ) = {Z} (see the left tree in Fig.©. 

By subtree perfectness, we have (letting N be the root node of Ti) 

gamb(st N (norm op t (T))) = gamb(norm opt (stAr(T))). 
Here, the right-hand side is opt(A|yl fl B), and 

gamb(st A r(norm opt (T))) = {X G X : AX © ~AZ G opt(AY © AZ|B), 
whence Property [3] follows. □ 

A. 10. Proof of Theorem 1251 Before we prove Theorem [25l we prove a few lem- 
mas, provide two counterexamples, and introduce a new property called path inde- 
pendence. 

Lemma 44 (Sen [24|, Proposition 17]). A choice function opt satisfies Property^ 
if and only if, for any non-empty event A and any finite family of non-empty finite 
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A-consistent sets of gambles X\, ... X n , 



opt \j Xi 



A) C opt \Jopt(Xi\A) 



A) C |Jopt(^|A). 



Property 11. j4 choice function opt is said to oe path independent i/, /or any 
non-empty event A, and for any finite family of non-empty finite A-consistent sets 
of gambles X\, . . . , X n , 



opt (J Xi 



A] =opt \Jopt(X t \A) 



.4 



Path independence appears frequently in the social choice literature. Plott [18] 
gives a detailed investigation of path independence and its possible justifications. 
Path independence is also equivalent to Axiom 7' of Luce and Raiffa [TS1 p. 289]. 

Lemma 45 (Sen [241 Proposition 19]). A choice function opt satisfies Properties^ 
and\^if and only if opt satisfies Provertu\ll\ 

Properties 13 [9] and QT] are expressed slightly differently here than in Sen [24] , 
who does not use the concepts of conditioning and consistency. Despite this, the 
proofs of Lemmas 03] and 05] proceed identically to the corresponding propositions 
by Sen. Also, Sen defines path independence only for pairs of subsets, but Plott [TSJ 
Theorem 1, p. 1082] shows that this type of path independence is indeed equivalent 
to Property [TT] 

Lemma 46. Let A\, .... A n be a finite partition of CI, n > 2. Let X, and X\, Z\, 
. . . , X n , Z n be finite sets of gambles. If 



X C A k X k © A k Z k 



for all k G {1, . . . , n}, then 



XC@A k X k . 



k=l 



Proof. Let X £ X, then X £ A k X k © A k Z k for all k. Therefore, for each k, there 
is an X k S X k such that A k X — A k X k . Whence, 



k=l 



k=l 



□ 



Lemma 47. Let A\ 7 . . . , A n be a finite partition of fl. Let B be any event such 
that Ai (~1 B ^ for all A^. Let X\ 7 . . . , X n be a finite family of non-empty finite 
sets of gambles where each X, L is Ai HB -consistent. If a choice function opt satisfies 
ProvertiesUft and \l(A then 



(19) 



opt ( @AiXi 



i=l 



B j cQAiopt(Xi\AinB). 



Proof. Let X = AiXi. Consider any k E {1, . . . , n} and let 

Zk = ^A' j X j 

3^k 
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where (A'j)jjtk forms an arbitrary partition of fi such that A^nA'^ = Aj for all j ^ k. 
Clearly, Zk is A& Pi -B-consistent because we can trivially find a consistent decision 
tree T with ev(T) = Ak OB and gamb(T) = Zk, using the Aj n -B-consistency (and 
hence, Ak H A^- n -B-consistency) of each for j ^ fc. 
Now, observe that by construction of Zk, 



X = A h Xk © A k Zk 



(J (A fe A fe ©A fe .z fe 



Note that is -B-consistent (indeed, because each Xi is Aj (~l -B-consistent, we can 
trivially find a consistent decision tree T with ev(T) = -B and gamb(T) = A"). 

If we apply opt(-|B) on both sides of the above equality, then it follows from 
Lemma 23] that 

opt(A|£) C (J opt(A fc Af fc © A fc Z fe |B) 

and by Property [TU] (once noted that Xk is A& Pi -B-consistent by assumption, and 
Zk is Ak fl -B-consistent by construction) , 

C (J (A fc opt(<Y fc |A fc nB)©A fc Z fe ) 
= A k opt(A? fc |A fe flB)® A fc Z fe 

whence by Lemma l46l 

71 

opt(A|B) c0A iO pt(^|^ns). 

i=l 

□ 

Lemma 48. Let A\, . . . , A n be a finite partition of Q. Let B be any event such 
that AiCiB ^ for all Aj. Let X\, . . . , X n be a finite family of non-empty finite sets 
of gambles, where each X t is Aj fl B-consistent. If a choice function opt satisfies 
Properties [5J and I j 01 £/ien 



(20) 



opt 0A,^ 



B J = opt Ai opt(A' l |A l n B) 



B 



Proof. By Lemma l47l and the definition of opt, 



opt ( 0A.A-, 



B j cQAiopt^lAinB) C04A 



whence Eq. (|20p follows by Property HJ 



Lemma 49. For any consistent decision tree T = 0™=i ^i^i an d an V choice f 
tion opt satisfying Property 

(21) gamb(norm opt (T)) = gamb ^ norm opt ^ O norm opt( r i)^ ^ 

norm opt (T) = norm opt Q Bj norm opt (Tj) . 



□ 

UT1C- 
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Proof. In this proof we require Eq. ((6]), and the following consequence of Eq. 
(22) gamb ^ norm opt ( Q Ei norm opt (Ti)^ ^ 

= opt ( opt(gamb(r i )|ev(T) n 



i=l 



ev(T) 



We first show that 

norm opt (r) 2 norm opt (•) norm opt (7* ' 



Consider a normal form decision J7 G norm op t(Q™ = i norm op t(Xi)). We have, 
by Eq. (El]), 



gamb(C/) C gamb ^norm opt f Q-^i normopt (21; ) J J = gamb(norm opt (T)). 

So, there exists a normal form decision V G norm opt (T) such that gamb(V^) — 
gamb([/). Since U G nfd(T), by definition of norm opt , U G norm opt (T), which 
establishes the claim. 
Next, we show that 

norm opt (T) C norm opt ^ Q Ei norm opt (Ti)^ . 

Consider a normal form decision U G norm opt (T). We know by Eq. (|21[) and 
Eq. (|22l) that 

gamb(C/) C opt ^ ^ opt(gamb(T 4 )|ev(T) PI £;) ev(T)^J . 
We can write U = Q" =1 E,U l , where [/, G nfd(T,), so by Eq. gb|, 

n / n \ 

Ei gamb(C/i) _ opt ^ opt(gamb(T J )|ev(T) n £;) ev(T) . 

i=l \ i=l / 

Consider normal form decisions Vi G norm opt (Ti). The above equation, and Eq. ([5]) 
tell us that, for each i, we can find Vi such that Eig&mb(Vi) — Eiga,mb(Ui). Of 
course, because Vi G norm opt (Ti), 

gamb(^) C opt(gamb(T l )|^ n ev(T)). 

We further have, for V = Q" =1 E,V, gamb(F) = gamb(£T) and V G nfd(T), and 
so V £ normopt (T). 

If we can establish that 

(23) gamb([/,) C opt (gamb (T*)^ n ev(T)), 

then, by definition of norm opt and because Ui G nfd(Ti), it follows that Ui G 
norm opt (T i ). So, in that case, there is a V G norm opt (©" =1 norm opt (Tj)) such 
that gamb(V^) = gamb([7), and U G nfd(©" =1 Ei norm opt (T i )). Therefore by def- 
inition of norm opt , we will have U G norm opt (Q™ = i Ei norm opt (Ti)), establishing 
the desired result. 
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We show that Eq. (|23|) indeed holds by Property When n = 1 the re- 
sult is trivial, so assume n > 2. Observe that both singletons gamb(t/;) and 
gamb(l^) are subsets of gamb(Xi), Ei gamb([/i) = E.- L gamb(T^), and gamb(Vi) C 
opt(gamb(T;)|E n ev(T)). Further, gamb(T l ) is Ei n ev(T)-consistent. We are 
almost ready to apply Property [7] 

We know that 



gamb(V) = 0£jgamb(V r i) C opt(gamb(T)|ev(T)) 

ev(T) 



2 = 1 



= opt 0E i gamb(T i ) 

\ i=l 

Letting 

Z={E X \A E 2 ) gamb(V 2 ) © E 3 gamb(V 3 ) © . . . © E„ gamb(K) 

and 

2 = (BiU E 2 ) gamb(T 2 ) © E 3 gamb(T 3 ) ®...®E n gamb(T„), 
we see that Z E Z and -E is J5i Pi ev(T)-consistent. Further, 

n 

E 1 gamb(Fi) 8 ExZ = E i gamb(^) = gamb(F) 

i=l 

and 

n 

E x gamb(T 1 ) © E X Z = Ej gamb(Ti). 

i=l 

We see that 

Ei gamb(V) © EiZ C opt(Ei gamb(Ti) © EiZ|ev(T)). 

Hence we have found a Z and a Z £ Z required to apply Property [7J Finally, by 
Ei gamb(J7i) = E\ gamb(Vi), and Property!?! we have 

gamb(J7i) C opt(gamb(Ti)|Ei n ev(T)). 

This argument applies for any index i and therefore Eq. (|23[) has been shown, 
establishing the result. □ 

Lemma 50. For any consistent decision tree T = |_|" =1 T%, and any choice function 
opt satisfying Property^ 



(24) gamb(norm opt (T)) = gamb y norm opt y |_J norm opt (Ti) J J 

implies 

norm opt (T) = norm opt ^ |_J norm opt (T.j)J . 

Proof. In this proof we require of Eq. ©. For clarity, let A = ev(T) = ev(Tj). We 
first show that 

norm opt (T) D norm opt ^ |_J norm opt (Ti)J . 



38 



NATHAN HUNTLEY AND MATTHIAS C. M. TROFFAES 



Consider a normal form decision U G norm op t(|_|" =1 normopt (Ti)). To show that U G 
normopt(T'), we must show that U G nfd(T) and gamb([7) C gamb(norm op t(T))- 
The former is obvious, and the latter is established by Eq. 

gamb(J7) C gamb ^norm opt ( |_J norm opt (T J :)^ j = gamb(norm opt (T)). 

Next we show that 

norm opt (T) C norm opt ( |_J norm opt (Ti)^ . 

Let U G norm op t(T). To show that U G norixioptdJ™^ normopt (I*)) we must show 
that U G nfd(|J" =1 norm op t(Ti)) and that 



gamb(t7) C gamb ( norm opt I | | norm, 



opt 



, i=l 




The latter requirement follows immediately from Eq. (|24p : 

gamb(C/) C gamb(norm opt (T)) = gamb norm opt |_J norm opt (T' J ) 



=i 



We now prove that U G nfd(|J™ =1 norm opt (Ti)). Let V be U with the root 
node removed, that is, U = UV . Clearly, for some k, V G nfd(Jfc). It suffices 
to show that V G norm opt ) . Let {X} — gamb(£7) = gamb(F). We know 
that X G opt (gamb (T) | A), and also that X G gamb(Tfc). If we can prove that 
X G gamb(norm opt (Tfe)) = opt(gamb(Tfc)|A), then V G norm opt (Tfe). Indeed, 
using Property [5] and gamb(Tfc) C gamb(T) we have 

opt(gamb(T fc )|A) D gamb(T fc ) n opt(garnb(T)|A). 

So we have shown that indeed X G opt(gamb(Tfc)|A), establishing the claim. □ 

Lemma 51. // back op t (T) = norm opt (T) for any consistent decision tree T, then 
opt satisfies Property^ 

Proof. Let A and B be non-empty events, and X, Z be non-empty finite sets of 
gambles, such that the following properties hold: A n £? 7^ 0, A n £> 7^ 0, X 
is A n B-consistent, Z is A n B-consistent, and there are X, Y G X such that 
AX = AY, X G opt(X\A n B), and AX © ~AZ G opt(AX © AZ\B) for at least 
one Z G Z. If it is not possible to construct such a situation, then opt satisfies 
Property [7] automatically. Otherwise, to prove that Property[7]holds, we must show 
that Y G opt(X\Ar\B). 

Consider a consistent decision tree T = AT1QAT2, where ev(T) = B, gamb(Xi) = 
X, and gamb(T2) = Z. Since X is A P\ inconsistent and Z is A n inconsistent, 
we know from Definition [9] that there is such a T. We have gamb(norm opt (T)) = 
opt (gamb (T)!^) = opt (AX © AZ\B). So, AX © ~AZ G gamb(norm opt (T)), and of 
course AX © ~AZ = AY~ © AZ. 

Therefore, any normal form decision in nfd(T) that induces the gamble AY ®AZ 
must be in norm opt (T). In particular, by Lemma BS1 there is a normal form decision 
U G nfd(Ti) such that gamb(C/) = {Y}, and a normal form decision V G nfd(Ta) 
such that gamb(F) = {Z}. So_AU QAV G nfd(T) and gamb (At/ 2V) = {AY © 
AZ}. Indeed, because AX © AZ G opt(AA? © AZ\B), it follows that AJ7 AV G 



39 



norm opt (T) = back opt (T). By definition, back opt (T) = norm opt ( A back opt (Ti ) 
Aback opt (T 2 )), and so it must hold that U 6 back opt (Ti) = norm opt (Ti). Whence, 
gamb(lf) C gamb(norm opt (Tj)) = opt(A'|A n B). Since gamb([/) = {Y}, we have 
Y e opt(X\A n B), establishing Property!?] □ 

Lemma 52. //gamb(back op t(T)) = gamb(norm opt (T)) for any consistent decision 
tree T , then opt satisfies Property I Hi 

Proof. Let A be a non-empty event, and X±, . . . , X n be non-empty finite sets of A- 
consistent gambles. Let T = |_|™ =1 Tj be a consistent decision tree, with gamb(Tj) = 
Xi for each i, and where ev(T) = A. The existence of T is assured by A-consistency 
of X (see Definition We have 



opt |J A- 



A J = gamb(norm opt (T)) = gamb(back opt (T)) 
= gamb norm opt |_J back op t(Ti) J J 



From Eq. ([S]), we have gamb(norm opt (T)) = opt(gamb(T)|A). Similarly, with re- 
peated applications of Eq. ([6]) and Eq. ((5J , 



opt 



opt 



gamb ( |_J back op t (Ti)) 

n 

[J gamb(back op t(T,;)) 



opt I (J gamb(r 



lopt 



opt |Jopt(gamb(T i )|A) 



Finally, we note that gamb(T) = U"=i %i an d gamb(Ti) = Xi. Therefore, 




opt Qopt(Ai|A) 



.4 



□ 



Lemma 53. //gamb(back pt(T)) = gamb(norm pt(T)) for any consistent decision 
tree T , then opt satisfies Property 1 1 01 

Proof. Let A and B be non-empty events such that AflB / and AnB ^ 0, 
let ^ be a non-empty finite set of A n B-consistent gambles, and let Z be an 
A n ^-consistent gamble. Let T — AT\ AT2 be a consistent decision tree, where 
ev(T) = B, gamb(Xi) = X, and gamb(T2) = {%}■ The existence of T is assured by 
A (~l B-consistency of A" and A n ^-consistency of {Z}. By assumption, 

gamb(norm opt (T)) = gamb(back opt (T)) 

= gamb(norm op t (A backopt (Ti) AT 2 )). 
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From Eq. ((6]), we have gamb(norm op t(T)) = opt(gamb(T)|B). Similarly, with 
repeated applications of Eq. ([6]) and Eq. {7J, 

garnb(norm opt (Abackopt(Ti) &AT 2 )) = opt(gamb(Aback opt (Ti) AT 2 )\B) 



Finally we note that gamb(T) = AX © AZ, and gamb(Ti) = X. Therefore, 
opt(A* ®~AZ\B) = opt(Aopt(A'|An J B) © AZ\B) C Aopt{X\A n B) © ~AZ. 



We now prove Theorem 1251 

Proof of Theorem [25l "only if" . By Lemmas [5lJ [52j and [53l we see that satisfying 
backward induction implies Properties [TUl and [TT] Lemma [45] completes the 
proof. 

"if" . We prove this part by structural induction on the tree. In the base step, 
we prove that the implication holds for consistent decision trees which consist of 
only a single node. In the induction step, we prove that if the implication holds for 
the subtrees at every child of the root node, then the implication also holds for the 
whole tree. 

First, if the decision tree T has only a single node, and hence, a reward at the 
root and no further children, then by definition (Eq. (|4a|) in particular) we have 
back opt (T) = norm opt (T). 

Next, suppose T is consistent and has a chance node as its root, that is, T = 
0»=i EiTi- By the induction hypothesis, we know that for every Ti, 

(25) gamb(back opt (T l )) = gamb(norm opt (T l )). 

We show that back opt (T) = norm opt (T). By Lemma |4T)1 it therefore suffices to 
show that gamb(norm opt (T)) = gamb(back opt (T)). By Eq. ^ and the definition 
of gamb, 



opt(Agamb(back opt (Ti)) ®AZ\B) 
opt(Agamb(norm opt (Ti)) ®AZ\B) 
opt(Aopt(gamb(Ti)|A n B) ®AZ\B). 



a 



gamb(norm opt (T)) = opt(gamb(T)|ev(T)) 
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and by Eq. f[25]) , Eq. ((6]) , and the definition of gamb, 

gamb(back opt (T)) = gamb ^norm opt ^ Q E, back opt (T! i ) 

= opt gamb Q^iback pt(r, 



i=l 



ev(T) 

= opt ^ 0£ ig amb(back opt (Ti)) ev(T)^ 
= ^ ^0^igamb(norm O p t (Ti)) ev(T) j 
= opt ( opt(gamb(T 4 )|ev(T) n £ 



i=l 



r{T) 



whence equality follows from Lemma H51 

Finally, suppose that the root of the consistent tree T is a decision node, that is 
T = UILi ^i- We show that back opt (T) = norm opt (r). By Lemma [501 it suffices to 
show that gamb(back opt (T)) = gamb(norm opt (T)). Indeed, 



gamb(norm opt (T)) = opt(gamb(T)|ev(T)) 



opt ^gamb^[_jT, 
opt ( (J gamb(T 1 



ev(T) 
ev(T)V 



and, 

gamb norm opt |_J back op t(T! i ) 



> i=l 



= opt 
= opt 
= opt 
= opt 
= opt 



/ / n 

gamb |_J back pt(T 4 

V \i=l 
/ n 

|J gamb(back opt (T: i )) 

\ i=l 
/ n 

(J gamb(norm pt(Ti)) 

V i=l 
/ n 

(Jopt(gamb(T l )|ev(T 4 )) 

\i=l 
/ n 

Uopt(gamb(T,;)|ev(T)) 

\ i=l 



ev(T)J 
ev(T)j 
ev(T)j 
ev(T)J 
ev(T)Y 



whence equality follows by Lemma 1451 and Property [TT] 

Concluding, we have shown that the implication holds for consistent decision 
trees consisting of a single nodes, and that if the implication holds for all children 
of the root node then it also holds for the whole tree. By induction, the implication 
holds for any consistent decision tree. □ 
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A.ll. Proof of Lemma 

Proof. Assume Property [2] holds. 

Consider any non-empty finite sets of normal form decisions X and y, and any 
event A / 0, such that opt(X\A) C y C A". By Property [2J it follows that 
opt(3>|A) = opt | A) n y, which is equal to apt(X\A) because opt(X\A) C y. 
This proves Property [8] 

Consider any non-empty finite sets of normal form decisions X and y, and any 
event A^% such that y C X. If opt(A?|A) n ^ = 0, then obviously opt(^|A) 3 
opt(X\A) n If not, then opt(y\A) = opt(X\A) n So, Property 1 follows. □ 



A.12. Proof of Theorem | 

Lemma 54. Lei norm be any normal form operator. Let T be a consistent decision 
tree. If, 

(i) for all nodes K £ ch(T) such that K is in at least one element o/norm(T), 

stif(norm(T)) C norm(st*r(T)), 

(ii) and, for all nodes K G ch(T), and all nodes L £ stjf(T) such that L is in 
at least one element of norm(stx(T)), 

sti(norm(st*:(T))) C norm(st L (st A '(T))), 

then, for all nodes N in T such that N is in at least one element o/norm(T), 

stAr(norm(T)) C norm(stjv(T)). 

Proof. If N is the root of T, then the result is immediate. If N £ ch(T), then the 



result follows from (i) Otherwise, N must be in st k(T) for one K e ch(T). 

By assumption, there is a U 6 norm(T) that contains N (and of course also K). 
Therefore, U 6 stx(norm(T)), and by [(i)] stx(U) S norm(stjf(T)), and so N is 
also in at least one element of norm(st k(T)). 

We use the fact that, if U and V are sets of normal form decisions such that 



U C V, then for any node N, stjv(^0 C stjv(V). Combining everything, by (i) 
stjv(st*:(norm(T))) C stjv(norm(st K (T))) 



hence, since N is in at least one element of norm(st A (T)), by (ii) we have 

C norm(stjv(stx(T 1 ))), 

whence the desired result follows, since stjv(st.R- (T)) = stjy(T). □ 

The following results are very similar to Lemmas [HIJ HH and [531 

Lemma 55. If norm opt is subtree perfect for normal form decisions, then opt 
satisfies Property^ 

Proof. Let A and B be non-empty events, and X, Z be non-empty finite sets of 
gambles, such that the following properties hold: AflB^fl, AflB/(3, <Y 
is A n B-consistent, Z is A n -B-consistent, and there are A, F £ <Y such that 
AX = AY, X £ opt(A'|A n B), and AX ®AZ £ opt (AY © AZ\B) for at least 
one Z £ Z. If it is not possible to construct such a situation, then opt satisfies 
Property [7] automatically. Otherwise, to prove that Property[7]holds, we must show 
that Y £ opt(X\ADB). 
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Consider a consistent decision tree T = AT1QAT2, where ev(T) = B, gamb(Ti) = 
X, and gamb(T 2 ) = Z. Since X is An B-consistent and Z is A n inconsistent, we 
know from Definition [5] that there is such a T. By Lemma [33] there is a normal 
form decision ?7 £ nfd(T\) such that gamb(t/) = {Y}, and a normal form decision 
in V £ nfd(T 2 ) such that gamb(V) = {Z}. 

Since, by assumption, AX — AY, obviously AX © AZ = AY © AZ, and hence, 
also AY © AZ £ opt(^4,Y © ~AZ\B). Therefore, by definition of norm opt , AU 
AV £ norm opt (T). In particular, U £ stAr(norm opt (T)), where N is the root of T\. 
Because norm opt is subtree perfect for normal form decisions, it follows that also 
U £ normopt (stjv(T)) = norm opt (Ti). Again applying the definition of norm opt , we 
conclude that indeed Y £ opt(X\A n B). □ 

Lemma 56. If norm op t is subtree perfect for normal form decisions, then opt 
satisfies Property^ 

Proof. Let A be a non-empty event, and let X be a non-empty finite set of A- 
consistent gambles. Let y be a non-empty subset of X. Let T = T\ U T2, where 
ev(T) = A, gamb(Ti) = y, and gamb(T 2 ) = X. Let N be the root of T x . 

If opt (A" I A) n ^ = then Property U holds automatically. Suppose opt (X | A) D 
y ^ 0. By definition of norm opt , N appears in at least one element of norm opt (r), 
and 

opt(X\A) ny = gamb(st w (norm opt , (T))), 

and by subtree perfectness of normal form decisions 

C gamb(norm op t(Ti)) 
= opt(y\A). 

a 

Lemma 57. If norm opt is subtree perfect for normal form decisions, then opt 
satisfies Provertv \l(A 

Proof. Let A and B be non-empty events such that A n B ^ and A n -B 7^ 0, let 
A be a non-empty finite set of A n i?-consistent gambles, and let Z be an A n in- 
consistent gamble. Let T = AT1QAT2 be a consistent decision tree, where ev(T) = 

gamb(Ti) = A, and T 2 is simply a normal form decision with gamb(T 2 ) = {Z}. 
The existence of T is assured by A n B-consistency of <Y and A n S-consistency of 
{Z}. Let TV be the root of Ti. 

Consider any gamble AX (BAZ £ opt (AX ®AZ\B). By Lemma l42l there is 
a U £ nfd(Ti) such that gamb({7) = X. By definition of norm opt , it follows that 
AU AT2 £ norm opt (T), and hence, in particular, U £ stAr(norm opt (T)). By 
subtree perfectness for normal form decisions, U £ norm opt (Ti). Again applying 
the definition of norm opt , we find that X £ opt(X\A n B), thus indeed AX © AZ £ 
Aopt(X\A n B) © AZ, whence Property [10] is established. □ 

We can now prove Theorem [53 

Proof of Theorem "only if" . Follows from Lemmas [55l [551 and [57] 

"if". We proceed as usual by structural induction. The base step is trivial as 
usual. Let ch(T) = {Ki, . . . ,K n } and let Ti — st^T). The induction hypothesis 
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says that norm op t is subtree perfect for normal form decisions on all Ti. More 
precisely, for all Ti, and for every L that is in at least one element of norm pt(Ti), 

st L (norm op t(Ti) C norm opt (st L (T)). 

We must show that, for any N in at least one element of norm opt (T), 

st w (norm(T)) C norm(stiv(T)). 

By the induction hypothesis and Lemma [5H it suffices to show this only for N G 
ch(T), that is, to show that 

(26) st K! (norm opt (T)) C norm opt (T 4 ) 

for each i such that Ki is in at least one element of norm op t(T). 

Suppose the root of T is a decision node, so T — |J"=i ^i- Let U be an element of 
norm opt (T). There is a j such that Kj is in U; let Uj denote st^-., (U). To establish 
Eq. (|26[) we must show that Uj £ norm opt (Tj). 

Note that gamb^) = gamb(J7) C opt(gamb(T)|ev(T)) since U e norm opt (T). 
Obviously, also ga,vab(Uj) G gamb(T,,) by definition of gamb. Hence, it must hold 
that 

gamb(L7 J ) C opt(gamb(T)|ev(T)) n gamb(Tj), 

but, also, because gamb(Tj) C gamb(T), and once noted that ev(T) = ev(Tj), it 
follows from Property |H] that 

Copt(gamb(T 3 )|ev(r,)) 

Putting everything together, we confirm that Uj 6 norm op t (Tj ) . This proves the 
induction step for decision nodes. 

Now suppose that the root of T is a chance node, so T — 0™ =1 E{Fi. Again, 
let U = 0™ =1 EiUi s norm opt (T). To establish Eq. (|2"o]) we must show that Ui £ 
norm opt (Ti) for all i. 

Indeed, since U € norm opt (T), 

gamb(C7) e opt(gamb(T)) = opt ^ E l gamb(T l ) ev(T)^J 
so by Lemma [47l 

C0£i opt(gamb(T l )|ev(T) n E t ). 

So, for each T^, there is a normal form decision V\ G norm opt (Ti) such that Ei gamb(T^) 
Eigaxnb(Ui). Can we apply Property [7]? 

Obviously, {gamb(Vi), gamb(E/i)} C gamb(Tj), and 

Ei gamb(VS) ®E~Z = gamb(C7) 

C opt(0^gamb(T l )|ev(T)) = opt(B< gamb(T;) © ^Z|ev(T)) 

for suitable choices for Z and a, Z E Z. Therefore we can apply Property [7J to 
conclude that Ui is in norm opt (Tj) for each i. This proves the induction step for 
chance nodes. □ 
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A.13. Proofs for Section EHJ 

Proof of Lemma \3(A If N is in at least one element of norm(T) but does not appear 
in ext(T), then N will not appear in any element of nfd(ext(T)) and equivalence 
will fail. If a node M is in ext(T) but not in any element of norm(T), then M will 
appear in at least one element of nfd(ext(T)) and equivalence will fail. Therefore a 
node is in ext(T) if and only if it is in at least one element of norm(T). □ 

Proof of Lemma [31\ "if" . Suppose ext is subtree perfect and a node N is in ext(T), 
then N is in at least one element of norm(T). Because nfd and stN commute, we 
have 

norm(st w (T)) = nfd(ext(stjv(T))) = nfd(stjv(ext(T))) 
= stjy(nfd(ext(T))) = stjv(norm(T)). 

This demonstrates subtree perfectness of norm. 

"only if" . By Lemma 1301 for a particular norm there can be no more than one 
equivalent ext. We show that this ext is subtree perfect. A node N is in this ext 
if and only if it is in at least one element of norm(T). Similarly, a node M is in 
stjv(ext(T)) if and only if M is in at least one element of st,/v(norm(T)). By subtree 
perfectness of norm, the latter is satisfied if and only if M is in at least one element 
of norm(stAr(T)). But, again by definition of ext, the latter is satisfied if and only 
if M is in ext(stAr(T)). This establishes subtree perfectness of ext. □ 

Proof of Theorem \3 6 A By Lemma I31[ if an equivalent ext exists then it is subtree 
perfect. We must show that the ext constructed in Lemma l30l satisfies 

norm opt (T) = nfd (ext (T)) 

for all consistent decision trees T. 

We now proceed by structural induction. The base step, that nfd(ext(T)) = 
norm op t(T) for any decision tree comprising only a single node, is as usual satisfied 
trivially. 

Let us proceed with the induction step. The induction hypothesis states that, 
for any node K in ch(T), nfd(ext(st^(T))) = norm opt (stx(7 1 )). We must show that 
norm opt (T) = nfd (ext (T)). 

It is useful to show first that, if JC is the set of all K € ch(T) that appear in at 
least one element of norm opt (r) (or equivalently, that appear in ext(T)), then for 
any K £ IC, 

(27) st ic (nfd(ext(T))) = st K (norm opt (T)). 

Consider any node K in ch(T) that appears in ext(T). Clearly, 
stx(nfd(ext(T))) = nfd(st K (ext(T))) 

and since we just proved that ext is subtree perfect, 

= nfd(ext(stif(T))) 

and by the induction hypothesis, 



= norm opt (st if (T)) 
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but, by definition of ext, the node K also appears in at least one element of 
norm op t (T) , so by the subtree perfectness of norm opt , 

= stif (normopt (T)). 

This establishes Eq. ([2"T1) . 

Now, suppose that the root of T is a decision node. Observe that 

nfd(ext(T)) = [J nfd(st K (ext(T))), 
kek 

and since st(-) and nfd(-) commute, 

= [J st K (nfd(cxt(T))). 
kgK 

Also, because opt satisfies Properties [T] and [2l we have (as seen in the proof of 
Theorem [221) 

norm opt (T) = |_| st^ (norm opt (T)), 
KeK 

whence by Eq. (f27|) . 

norm op t (T) = nfd(ext(T)). 
Finally, suppose that the root of T is a chance node. Here, /C is simply ch(T) = 
{Ki, . . . , K n }. Similarly to before, we have 

n 

nfd(ext(T)) = Q£; i nfd(stK i (ext(T))) 

i=l 
n 

= 0^ st ^( nfd ( cxt ( T )))- 

i=l 

Since opt satisfies Properties [U [21 and [31 we have (as seen in the proof of Theo- 
rem^, 

n 

norm op t (T) = Q ^ norm op t(st i f i (T)) 

4=1 

n 

= QEi st Ki (norm pt (T)), 

i=l 

whence by Eq. (|2"T)) . we have 

nfd(cxt(T)) = normopt (T). 

□ 
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